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COMPOSITIONS AND METHODS FOR THE TREATMENT OF 
BODY WEIGHT DISORDERS. INCLUDING OBESITY 

This is a continuation-in-part of Serial No. 
5 08/470,868, filed June 6, 1995, which is a 

continuation-in-part of Serial No. 08/294,522, filed 
August 23, 1994, both applications of which are 
incorporated herein by reference in their entirety. 

10 INTRODUCTION 

The present invention relates to methods and 
compositions for the treatment of body weight 
disorders, including, but not limited to, obesity. 
Specifically, the present invention identifies and 

15 describes genes which are differentially expressed in 
body weight disorder states, relative to their 
expression in normal, or non-body weight disorder 
states, and also identifies genes which are 
differentially expressed in response to manipulations 

20 relevant to appetite and/or weight regulation. 
Further, the present invention identifies and 
describes genes via the ability of their gene products 
to interact with gene products involved in body weight 
disorders and/or to interact with gene products which 

25 are relevant to appetite and/or body weight 

regulation. Still further, the present invention 
provides methods for the identification and 
therapeutic use of compounds as treatments of body 
weight disorders. Additionally, the present invention 

30 describes methods for the diagnostic evaluation and 
prognosis of various body weight disorders, and for 
the identification of subjects exhibiting a 
predisposition to such conditions. 

35 2 . BACKGROUND OF THE INVENTION 

Body weight disorders, including eating 
disorders, represent major health problems in all 
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industrialized countries. Obesity, the most prevalent 
of eating disorders, for example, is the most 
important nutritional disorder in the western world, 
with estimates of its prevalence ranging from 3 0% to 
5 50% within the middle-aged population. Other body 

i 

weight disorders, such as anorexia nervosa and bulimia 
nervosa which together affect approximately 0.2% of 
the female population of the western world, also pose 
serious health threats. Further, such disorders as 
10 anorexia and cachexia (wasting) are also prominent 
features of other diseases such as cancer, cystic 
fibrosis, and AIDS. 

Obesity, defined as an excess of body fat 
relative to lean body mass, also contributes to other 

15 diseases. For example, this disorder is responsible 
for increased incidences of diseases such as coronary 
artery disease, stroke, and diabetes. Obesity is not 
merely a behavioral problem, i.e., the result of 
voluntary hyperphagia. Rather, the differential body 

20 composition observed between obese and normal subjects 
results from differences in both metabolism and 
neurologic /metabolic interactions . These differences 
seem to be, to some extent, due to differences in gene 
expression, and/or level of gene products or activity. 

25 The nature, however, of the genetic factors which 

control body composition are unknown, and attempts to 
identify molecules involved in such control have 
generally been empiric and the parameters of body 
composition and/or substrate flux are monitored have 

30 not yet been identified (Friedman, J.M. et al . , 1991, 
Mammalian Gene 1:130-144) . 

The epidemiology of obesity strongly shows that 
the disorder exhibits inherited characteristics, 
(Stunkard, 1990, N. Eng. J. Med. 322:1483) . Moll et 

35 al . , have reported that, in many populations, obesity 
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10 



15 



seems to be controlled by a few genetic loci, (Moll et 
al. 1991, Am. J. Hum. Gen. 41:1243). in addition, 
human twin studies strongly suggest a substantial' 
genetic basis in the control of body weight, with 
estimates of heritability of 80-90% (Simopoulos, a. P. 
& Childs B., eds., 1989, in "Genetic Variation and 
Nutrition in Obesity", World Review of Nutrition and 
Diabetes 63, S. Karger, Basel, Switzerland; Borjeson, 
M., 1976, Acta. Paediatr. Scand. 65_: 279-287) . 

Further, studies of non-obese persons who 
deliberately attempted to gain weight by 
systematically over- eating were found to be more 
resistant to such weight gain and able to maintain an 
elevated weight only by very high caloric intake. In 
contrast, spontaneously obese individuals are able to 
maintain their status with normal or only moderately 
elevated caloric intake. 

In addition, it is a commonplace experience in 
animal husbandry that different strains of swine, 
20 cattle, etc., have different predispositions to 

obesity. Studies of the genetics of human obesity and 
of models of animal obesity demonstrate that obesity 
results from complex defective regulation of both food 
intake, food induced energy expenditure and of the 
balance between lipid and lean body anabolism. 

There are a number of genetic diseases in man and 
other species which feature obesity among their more 
prominent symptoms, along with, frequently, dysmorphic 
features and mental retardation. Although no 
mammalian gene associated with an obesity syndrome has 
yet been characterized in molecular terms, a number of 
such diseases exist in humans. For example, Prader- 
Willi syndrome (PWS) affects approximately l in 20,000 
live births, and involves poor neonatal muscle tone, 
facial and genital deformities, and generally obesity. 
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The genetics of PWS are very complex, involving, for 
example, genetic imprinting, in which development of 
the disease seems to depend upon which parent 
contributes the abnormal PWS allele. In approximately 
5 half of all PWS patients, however, a deletion on the 
long arm of chromosome 11 is visible, making the 
imprinting aspect of the disease difficult to 
reconcile. Given the various symptoms generated, it 
seems likely that the PWS gene product may be required 
10 for normal brain function, and may, therefore, not be 
directly involved in adipose tissue metabolism. 

In addition to PWS, many other pleiotropic 
syndromes which include obesity as a symptom have been 
characterized. These syndromes are more genetically 
15 straightforward, and appear to involve autosomal 

recessive alleles. The diseases, which include, among 
others, Ahlstroem, Carpenter, Bardet -Biedl , Cohen, and 
Morgagni- Stewart -Monel Syndromes. 

Animals having mutations which lead to syndromes 
2 0 that include obesity symptoms have also been 

identified. Attempts have been made to utilize such 
animals as models for the study of obesity. The best 
studied animal models for genetic obesity are mice 
which contain the autosomal recessive mutations ob/ob 
25 ( obese ) and db/db ( diabetes ) . These mutations are on 
chromosomes 6 and 4, respectively, but lead to 
clinically similar pictures of obesity, evident 
starting at about 1 month of age, which include 
hyperphagia, severe abnormalities in glucose and 
30 insulin metabolism, very poor thermo-regulation and 
non- shivering thermogenesis , and extreme torpor and 
underdevelopment of the lean body mass. Restriction 
of the diet of these animals to restore a more normal 
body fat mass to lean body mass ration is fatal and 
35 does not result in a normal habitus. 
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Although the phenotypes of db/db and ob/ob mice 
are similar, the lesions are distinguishable by means 
of parabiosis. The feeding of normal mice and, 
putatively, all mammals, is regulated by satiety 
5 factors. The ob/ob mice are apparently unable to 

express the satiety factor, while the db/db mouse is 
unresponsive to it . 

In addition to ob and db, several other single 
gene mutations resulting in obesity in mice have been 

10 identified. These include the yellow mutation at the 
aqouti locus, which causes a pleiotropic syndrome 
which causes moderate adult onset obesity, a yellow 
coat color, and a high incidence of tumor formation 
(Herberg, L. . and Coleman, D.L., 1977, Metabolism 

15 26.:59), and an abnormal anatomic distribution of body 
fat (Coleman, D.L., 1978, Diabetologia 14. : 14 1 - 14 8 ) . 
Additionally, mutations at the fat and tubby loci 
cause moderately severe, maturity-onset obesity with 
somewhat milder abnormalities in glucose homeostasis 

20 than are observed in ob and db mice (Coleman, D.L., 
and Eicher, E.M., 1990, J . Heredity £1:424-427). 
Further, autosomal dominant mutations a: the adipose 
locus of chromosome 7, have been shown to cause 
obesity . 

25 Other animal models include f a / i a j i ; t y rats, 

which bear many similarities to the cb- it ar.J <ji db 
mice, discussed above. One difference is that, while 
f a / f a rats are very sensitive to cold, their capacity 
for non-shivering thermogenesis is normal. Torpor 

30 seems to play a larger part in the maintenance of 

obesity in fa/fa rats than in the mice mutants. In 
addition, inbred mouse strains such as NZO mice and 
Japanese KK mice are moderately obese. Certain hybrid 
mice, such as the Wellesley mouse, become 

35 spontaneously fat. Further, several desert rodents, 
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such as the spiny mouse, do not become obese in their 
natural habitats, but do become so when fed on 
standard laboratory feed. 

Animals which have been used as models for 
5 obesity have also been developed via physical or 
pharmacological methods. For example, bilateral 
lesions in the ventromedial hypothalamus (VMH) and 
ventrolateral hypothalamus (VLH) in the rat are 
associated, respectively, with hyperphagia and gross 
10 obesity and with aphagia and cachexia. Further, it has 
been demonstrated that feeding monosodium-glutamate 
(MSG) to new born mice also results in an obesity 
syndrome . 

Attempts have been made to utilize such animal 

15 models in the study molecular causes of obesity. For 
example, adipsin, a murine serine protease with 
activity closely similar to human complement factor D, 
produced by adipocytes, has been found to be 
suppressed in ob/ob . db/db and MSG- induced obesity 

20 (Flier, 1987, Science 237 ; 405 ) . The suppression of 
adipsin precedes the onset of obesity in each model 
(Lowell, 1990, Endocrinology 126 : 1514 ) . Further 
studies have mapped the locus of the defect in these 
models to activity of the adipsin promoter (Piatt, 

25 1989, Proc. Natl. Acad. Sci . USA 86 : 7490) . Further, 
alterations have been found in the expression of 
neuro- transmitter peptides in the hypothalamus of the 
ob/ob mouse (Wilding, 1993, Endocrinology 1^32:1939), 
of glucose transporter proteins in islet /3-cells 

30 (Ohneda, 1993, Diabetes 42:1065) and of the levels of 
G-proteins (McFarlane-Anderson, 1992, Biochem. J. 
282 : 15) . 

To date however, no gene, in humans, has been 
found which is causative in the processes leading to 
35 obesity. Given the severity and prevalence, however, 
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of disorders, including obesity, which affect body 
weight and body composition, there exists a great need 
for the systematic identification of such body weight 
disorder-causing genes. 

5 

3 . SUMMARY OF THE INVENTION 
The present invention relates to methods and 
compositions for the treatment of body weight 
disorders, including, but not limited to, obesity. 

10 Specifically, the present invention identifies and 

describes genes which are differentially expressed in 
body weight disorder states, relative to their 
expression in normal, or non-body weight disorder 
states, and also identifies genes which are 

15 differentially expressed in response to manipulations 
relevant to appetite and/or body weight regulation. 
Such differentially expressed genes may represent 
"target genes" and/or "fingerprint genes". Further, 
the present invention identifies and describes genes, 

20 termed "pathway genes", via the ability of their gene 
products to interact with gene products involved in 
body weight disorders and/or to interact with gene 
products which are relevant to appetite and body 
weight regulation. Pathway genes may also exhibit 

25 target gene and/or fingerprint gene characteristics. 

"Differential expression", as used herein, refers 
to both quantitative as well as qualitative 
differences in the genes' temporal and/or tissue 
expression patterns. "Fingerprint gene," as used 

30 herein, refers to a differentially expressed gene 

whose expression pattern may be utilized as part of a 
prognostic or diagnostic body weight disorder 
evaluation, or which, alternatively, may be used in 
methods for identifying compounds useful for the 

35 treatment of body weight disorders. "Target gene", as 
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used herein, refers to a differentially expressed gene 
involved in body weight disorders and/or appetite or 
body regulation such that modulation of the level of 
target gene expression or of target gene product 
5 activity may act to ameliorate symptoms of body weight 
disorders including, but are not limited to, obesity. 

This invention is based, in part on systematic, 
search strategies involving body weight disorder 
experimental paradigms coupled with sensitive gene 

10 expression assays. 

The present invention also describes the products 
of such fingerprint, target, and pathway genes, 
describes antibodies to such gene products, and still 
further describes cell- and animal-based models of 

15 body weight disorders to which such gene products may 
contribute . 

The invention further provides methods for the 
identification of compounds which modulate the 
expression of genes or the activity of gene products 

20 involved in body weight disprders and processes 

relevant to appetite and/or body weight regulation. 
Still further, the present invention describes methods 
for the treatment of body weight disorders which may 
involve the administration of such compounds to 

25 individuals exhibiting body weight disorder symptoms 
or tendencies. 

Additionally, the present invention describes 
methods for prognostic and diagnostic evaluation of 
various body weight disorders, and for the 

30 identification of subjects exhibiting a predisposition 
to such disorders. 

The Examples presented in Sections 6-10, below, 
demonstrate the successful use of the body weight 
disorder paradigms of the invention to identify body 

35 weight disorder target genes. 
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4 • DESCRIPTION OF TH E FTCjTTPPg 
FIG. l. Differential display comparing RNAs 
from liver tissue of lean and obese mice. Each group 
(1-11) of four lanes shows the pattern obtained with 
5 one primer pair combination, for a total of eleven 
different primer pair combinations. All lanes are 
products of a polymerase chain reaction (PCR) in which 
T tl GG was used as the 3' oligonucleotide and one of 
eleven different arbitrary lOmer oligonucleotides was 
10 used as the 5' oligonucleotide. Within each group of 
four lanes, the loading is as follows, from left to 
right: C57B1/6J lean control (marked "C"); C57B1/6J 
Qb/ob (marked »ob»); C57B1/KS lean control (marked 
"C") ; and C57B1/KS db/db (marked »db»). An arrow 
indicates a band (designated L36) that is differential 
between obese and lean samples amplified by the same 
primer pair, specifically, primer pair 6. 



15 



20 



FIG. 2. Northern blot analysis confirming 
differential regulation of a gene corresponding to 
band L36. Poly A* RNA (1/ig/lane) obtained from the 
original liver total RNA preparations was hybridized 
with a cDNA probe prepared by random priming of 
reamplified lane L36 (see materials and methods, 
25 below, in Section 6.1). Lane 1, C57B1/6J lean control 
("C") ; lane 2, C57B1/6J ob/ob (»ob»); lane 3 C57B1/Ks 
lean control ("C"); and lane 4, C57B1/Ks db/db (»db»). 

FIG. 3A. Consensus nucleotide sequence of L36 
3 0 amplified band (SEQ. ID NO: 1) . The IUPAC-IUB 

Standard Code is used (with the addition of the "X" 
designation, as shown below) in this nucleotide 
sequence and those nucleotide sequences listed in the 
figures which follow. Upper case letters refer to 
35 perfect consensus matches at a particular base pair 
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position. Lower case letters refer to base pair 
positions at which there was a less than perfect 
consensus match. Specifically, the code used was as 
follows : 



10 



15 



20 



>de 


Base 


Meanina 


A 


A 


Adenine 


c 


c 


Pvtos 1 np 


G 


G 


Guanine 


T 


T 


Thymine 


U 


U 


Uracil (RNA) 


R 


A or G 


Purine 


Y 


C or T (or U) 


Pyrimidine 


K 


G or T (or U) 


Keto 


M 


A or C 


Amino 


S 


G or C 


Strong 


W 


A or T (or U) 


Weak 


B 


C, G, T (or U) 


not A 


D 


G, A, T (or U) 


not C 


H 


A, C, T (or U) 


not G 


V 


A, C, or G 


not T 


N 


A, C, G, T 


Any 




(or U) 




X 


A , C, G, T 


Any or none 




(or U) , or none 





25 



30 



FIG. 3B . Alignment of L36 consensus nucleotide 
sequence with a mouse stearoyl-CoA desaturase 
nucleotide sequence (SEQ. ID NOS : 2 , 3). It will be 
noted that there are two alignments listed for this 
Li36/mouse stearoyl-CoA desaturase match. Each 
represents a highly statistically significant 
alignment, and together, these alignments represent 
very highly significant matches. This is the case for 
each of the matches listed in the figures, below, 
which have greater than one alignment listed. 



35 
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FIG. 4. Alignment of P3 consensus nucleotide 
sequence (SEQ ID NO.:4, 39) with a mouse glutamine 
synthetase nucleotide sequence (SEQ ID NOS.:5, 6). 

FIG. 5. Alignment of P13 consensus nucleotide 
sequence (SEQ ID NO.:7, 40, 41, 42) with a mouse islet 
regenerating protein nucleotide sequence (SEQ NOS.:8 r 
9, 10, 11). 



10 FIG. 6. Alignment 

sequence (SEQ ID NO.: 12) 
nucleotide sequence (SEQ 



of F5 consensus nucleotide 
with a mouse alpha-amylase 
ID NO. :13) . 



FIG. 7. Alignment of murine C5 consensus 
nucleotide sequence (SEQ ID NO.: 14) with a rabbit 
uncoupling protein nucleotide sequence (SEQ ID 
NO. : 15) . 



FIG. 8A-8B. Alignment of L31/F74 consensus 
nucleotide sequence (SEQ ID NO.:16) with a mouse major 
urinary protein II nucleotide sequence (SEQ ID 
NO. : 17) . 



FIG. 9. Alignment of L7/L21 consensus 
nucleotide sequence (SEQ ID NO.: 18) with a mouse 
cytochrome oxidase c subunit I nucleotide sequence 
(SEQ ID NO. :19) . 

FIG. 10. Alignment of L29 consensus nucleotide 
sequence (SEQ ID NO.:20) with a mouse testosterone 15- 
alpha hydroxylase nucleotide sequence (SEQ ID NO.:21). 

FIG. 11. Alignment of L38 consensus nucleotide 
sequence (SEQ ID NO.:22, 43) with a mouse 24p3 (a 



lipocalin family member of unknown function) 
nucleotide sequence (SEQ ID NO.:23, 24). 

FIG. 12. Alignment of L3 7 consensus nucleotide 
5 sequence (SEQ ID NO.:25, 44, 45) with a mouse p6-5 (a 
mouse sequence 86% homologous to rat preproelastase I) 
nucleotide sequence (SEQ ID NO.:26, 27, 28). 

FIG. 13. Alignment of L57 consensus nucleotide 
10 sequence (SEQ ID N0.:29, 46, 47, 48) with a mouse 

orphan hormone receptor nucleotide sequence (SEQ ID 
NO. :30, 31, 32, 33) . 

FIG. 14. Full length F49 cDNA clone. A cDNA 
putatively encoding the full length coding sequence of 
the F4 9 gene was isolated and its nucleotide sequence 
is listed herein (SEQ ID N0.:34). The F49 coding 
sequence encodes a 96 amino acid protein whose 
sequence is also listed herein (SEQ ID NO.:35). The 
initiating methionine codon and the termination codon 
are boxed . 

FIG. 15. Hydropathy plot of the F4 9 gene 
product. 

FIG. 16A-16B. Depicted herein are, first, the 
full length mouse C5 nucleotide sequence (SEQ ID 
NO.: 36), and second, the amino acid sequence (SEQ ID 
N0.:37) encoded by the mouse C5 gene. The initiating 
methionine and the termination codon are boxed. 

FIG. 17A-17B. Human C5 cDNA. The bottom line of 
the Figure depicts the nucleotide sequence of a human 
C5 cDNA clone (SEQ ID NO.:38); .the top line of the 
35 Figure depicts the portion of the human C5 amino acid 



15 



20 



25 



30 



- 12 - 

SUBSTITUTE SHEET (RULE 26) 



WO 96/05861 



PCT/US95/10918 



sequence (SEQ ID NO: 56) derived from the cDNA 
nucleotide sequence. 

FIG. 18. Full length C5 amino acid sequence (SEQ 
5 ID NO: 51) . 

FIG. 19. Northern analysis of tissue 
distribution of human C5 mRNA. 

10 FIG - 20 • Alignment of H27 consensus nucleotide 

sequence (SEQ ID NO: 49) with a mouse autoantigen La 
nucleotide sequence (SEQ ID NO: 50) . 
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20 



FIG. 21. Alignment of F84 consensus nucleotide 
sequence (SEQ ID NO: 52, 54) with a mouse cytochrome 
p450 IID nucleotide sequence (SEQ ID NO: 53, 55) 

FIG. 22. Mouse L34 cDNA nucleotide sequence (SEQ 
ID NO: 57) . 



5 • DETAILE D DESCRIPTION OF THE TMVENTION 
Methods and compositions for the treatment of 
body weight disorders, including, but not limited to, 
obesity. Genes, termed "target genes" and/or 
?5 "fingerprint genes", are described which are 

differentially expressed in body weight disorder 
states, relative to their expression in normal, or 
non-body weight disorder states, and/or which are 
differentially expressed in response to manipulations 

0 relevant to appetite and/or body weight regulation. 
Additionally, genes, termed "pathway genes", are 
described whose gene products exhibit an ability to 
interact with gene products involved in body weight 
disorders and/or with gene products which are relevant 

5 to appetite and/or body weight regulation. Methods 
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for the identification of such fingerprint, target, 

and pathway genes are also described. 

Further, the gene products of such fingerprint, 

target, and pathway genes are described, antibodies to 
5 such gene products are described, as are cell- and 

animal -based models of body weight disorders to which 

such gene products may contribute. 

Described, below, are methods for the 

identification of compounds which modulate the 
10 expression of genes or the activity of gene products 

involved in body weight disorders and processes 

relevant to appetite and/or body weight regulation. 

Additionally described, below, are methods for the 

treatment of body weight disorders. 
15 Also discussed, below, are methods for prognostic 

and diagnostic evaluation of various body weight 

disorders, and for the identification of subjects 

exhibiting a predisposition to such disorders. 

20 

5.1. IDENTIFICATION OF DIFFERENTIALLY 
EXPRESSED AND PATHWAY GENES 

Described herein are methods for the 
identification of genes which are involved in body 

25 weight disorder states, and/or which are involved in 
appetite and body weight regulation. Such genes may 
represent genes which are differentially expressed in 
body weight disorder states relative to their 
expression in normal, or non-body weight disorder 

30 states. Further, such genes may represent genes which 
are differentially regulated in response to 
manipulations relevant to appetite and body weight 
regulation. Such differentially expressed genes may 
represent "target" and/or "fingerprint" genes. 

35 Methods for the identification of such differentially 
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expressed genes are described, below, in Section 
5.1.1. Methods for the further characterization of 
such differentially expressed genes, and for their 
identification as target and/or fingerprint genes, are 
5 presented, below, in Section 5.1.3. 

In addition, methods are described herein, in 
Section 5.1.2, for the identification of genes, termed 
"pathway genes", involved in body weight disorder 
states, and/or in appetite or body weight regulation. 
10 "Pathway gene", as used herein, refers to a gene whose 
gene product exhibits the ability to interact with 
gene products involved in body weight disorders and/or 
to interact with gene products which are relevant to 
appetite or body weight regulation. A pathway gene 
may be differentially expressed and, therefore, may 
have the characteristics of a target and/or 
fingerprint gene. 

"Differential expression" as used herein refers 
to both quantitative as well as qualitative 
differences in the genes' temporal and/or tissue 
expression patterns. Thus, a differentially cxrfrMd 
gene may qualitatively have its express—.-, a:: 
or completely inactivated in normal verr^« t ; Jv . 
disorder states, or under control verr..; ' xr . 
25 conditions. Such a qualitatively re?-...-. . : : . ... _ 

exhibit an expression pattern within .. : ... .... cr 

cell type which is detectable in either ccr.tri. it 
body weight disorder subjects, but is not detectable 
in both. Alternatively, such a qualitatively 
regulated gene will exhibit an expression pattern 
within a given tissue or cell type which is detectable 
in either control or experimental subjects, but is not 
detectable in both. "Detectable", as used herein, 
refers to an RNA expression pattern which is 
35 detectable via the standard techniques of differential 
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display, RT-PCR and/or Northern analyses, which are 
well known to those of skill in the art. 

Alternatively, a differentially expressed gene 
may have its expression modulated, i.e. . 
5 quantitatively increased or decreased, in normal 

versus body weight disorder states, or under control 
versus experimental conditions. The degree to which 
expression differs in normal versus body weight 
disorder or control versus experimental states need 

10 only be large enough to be visualized via standard 

characterization techniques, such as, for example, the 
differential display technique described below. Other 
such standard characterization techniques by which 
expression differences may be visualized include but 

15 are not limited to, quantitative RT (reverse 
transcriptase) PCR and Northern analyses. 

Differentially expressed genes may be further 
described as target genes and/or fingerprint genes. 
"Fingerprint gene, " as used herein, refers to a 

20 differentially expressed gene whose expression pattern 
may be utilized as part of a prognostic or diagnostic 
body weight disorder evaluation, or which, 
alternatively, may be used in methods for identifying 
compounds useful for the treatment of body weight 

25 disorders. A fingerprint gene may also have the 

characteristics of a target gene or a pathway gene. 

"Target gene", as used herein, refers to a 
differentially expressed gene involved in body weight 
disorders and/or appetite or body regulation in a 

3 0 manner by which modulation of the level of target gene 
expression or of target gene product activity may act 
to ameliorate symptoms of body weight disorders 
including, but are not limited to, obesity. A target 
gene may also have the characteristics of a 

35 fingerprint gene and/or a pathway gene. 
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5.1.1. METHODS FOR THE IDENTIFICATION OF 
DIFFERENTIALLY EXPRESSED GENES 

A variety of methods may be utilized for the 

identification of genes which are involved in body 

weight disorder states, and/or which are involved in 

appetite and body weight regulation. Described in 

Section 5.1.1.1 are several experimental paradigms 

which may be utilized for the generation of subjects 

and samples which may be used for the identification 

of such genes. Material from the paradigm control and 

experimental subjects may be characterized for the 

presence of differentially expressed gene sequences as 

discussed, below, in Section 5.1.1.2. 

5.1.1.1. PARADIGMS FOR THE IDENTIFICATION 
OF DIF FERENTIALLY EXPRESSED GENFS 
Among the paradigms which may be utilized for the 
identification of differentially expressed genes 
involved in, for example, body weight disorders, are 
paradigms designed to analyze those genes which may be 
involved in short term appetite control. Accordingly, 
such paradigms are referred to as "short term appetite 
control paradigms." These paradigms may serve to 
identify genes involved in signalling hunger and 
satiety . 

In one embodiment of such a paradigm, test 
subjects, preferably mice, may be fed normally prior 
to the initiation of the paradigm study, then divided 
into one control and two experimental groups. The 
control group would then be maintained on ad lib 
nourishment, while the first experimental group 
("fasted group") would be fasted, and the second 
experimental group ( " f asted-ref ed group") would 
initially be fasted, and would then be offered a 
highly palatable meal shortly before the collection of 
tissue samples. Each test animal should be weighted 
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immediately prior to and immediately after the 
experiment. The Example presented in Section 7, 
below, demonstrates the use of such short term 
appetite paradigms to identify gene sequences which 
5 are differentially expressed in control versus fasting 
and versus refed animals. 

Among additional paradigms which may be utilized 
for the identification of differentially expressed 
genes involved in, for example, body weight disorders, 

10 are paradigms designed to analyze those genes which 
may be involved genetic obesity. Accordingly, such 
paradigms are referred to as "genetic obesity 
paradigms". In the case of mice, for example, such 
paradigms may identify genes regulated by the ob, db, 

15 and/or tub gene products. In the case of rats, for 
example, such paradigms may identify genes regulated 
by the fatty (fa) gene product. 

In one embodiment of such a paradigm, test 
subjects may include ob/ob , db/db . and/or tub/ tub 

20 experimental mice and lean littermate control animals. 
Such animals would be offered normal nourishment tcr a 
given period, after which tissue samples wr-Iri t- 
collected for analysis. The Examples prrscn:^ j : r. 
Sections 6 and 8, below, demonstrate t r.- ur- c i t ^rr. 

25 genetic obesity paradigms in identity;:.; 

sequences which are differentially express*- i : ese 

versus lean animals. 

In additional embodiments, ob/ob , db/db , and/or 
tub/tub experimental mice and lean control animals may 

30 be utilized as part of the short term appetite control 
paradigms discussed above, or as part of the set point 
and/or drug study paradigms discussed below. 

Paradigms which may be utilized for the 
identification of differentially expressed genes 

35 involved in body weight disorders may include 
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paradigms designed to identify those genes which may 
be regulated in response to changes in body weight. 
Such paradigms may be referred to as "set point 
paradigms" . 

In one embodiment of such a paradigm, test 
subjects, preferably mice, may be fed normally prior 
to the initiation of the paradigm study, then divided 
into one control and two experimental groups. The 
control group would then be maintained on an ad lib 
diet of normal nourishment in order to calculate daily 
food intake. The first experimental group 
("underweight group") would then be underfed by 
receiving some fraction of normal food intake, 60-90% 
of normal, for example, so as to reduce and maintain 
the group's body weight to some percentage, for 
example 80%, of the control group. The second 
experimental group ("overweight group") would be 
overfed by receiving a diet which would bring the 
group to some level above that of the control, for 
example 125% of the control group. Tissue samples 
would then be obtained for analysis. The Example 
presented in Section 9, below, demonstrates the use of 
such set point paradigms to identify gene sequences 
which are differentially expressed in control versus 
25 overweight and/or underweight conditions. 

Additionally, human subjects may be utilized for 
the identification of obesity-associated genes. In 
one embodiment of such a paradigm, tissue samples may 
be obtained from obese and lean human subjects and 
30 analyzed for the presence of genes which are 

differentially expressed in the tissue of one group as 
opposed to another ( e.g. differentially expressed in 
lean versus obese subjects) . In another embodiment, 
obese human subjects may be studied over the course of 
35 a period of weight loss, achieved through food 
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restriction. Tissue from these previously obese 
subjects may be analyzed for differential expression 
of gene products relative to tissue obtained from 
control (lean, non-previously obese) and obese 
5 subjects. 

Paradigms may be utilized for the identification 
of differentially expressed genes involved in body 
weight disorders may additionally include paradigms 
designed to identify genes associated with body weight 

10 disorders induced by some physical manipulation to the 
test ~ subj ect , such as, for example, hypothalamic 
lesion- induced body weight disorders. For example, 
bilateral lesions in the ventromedial hypothalamus 
(VMH) of rodents may be utilized to induce hyperphagia 

15 and gross obesity in test subjects, while bilateral 
lesions in the ventrolateral hypothalamus (VLH) of 
rodents may be utilized to induce aphagia in test 
subjects. In such paradigms, tissue from 
hypothalamic- lesioned test subjects and from control 

20 subjects would be analyzed for the identification of 
genes which are differentially expressed in control 
versus lesioned animals . 

Drugs known to affect ( e.g. , ameliorate) human or 
animal body weight and/or appetite (such as short term 

25 appetite) may be incorporated into paradigms designed 
to identify genes which are involved in body weight 
disorders and/or body weight or appetite regulation. 
Accordingly, such paradigms are referred to as "drug 
study paradigms" . Such compounds may include known 

30 therapeutics, as well as compounds that are not useful 
as therapeutics due to, for example, their harmful 
side effects. Among the categories of control and 
test subjects which may be utilized in such paradigms 
are, for example, lean subjects, obese subjects, and 

35 obese subjects which have received the drug of 
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interest. in various embodiments of the paradigms, 
subjects such as these may be fed a normal ad lib 
diet, a caloric restriction maintained diet, or a 
caloric restriction ad lib diet. Control and test 
5 subjects may additionally be pairfed i.e. . the control 
and test subjects may be fed via a coupled feeding 
device such that both control and test subjects 
receive identical amounts and types of food) . 

10 5.1.1.2. ANALYSIS OF PARADT QM MATRRTftT, 

In order to identify differentially expressed 
genes, RNA, either total or mRNA, may be isolated from 
one or more tissues of the subjects utilized in 
paradigms such as those described, above, in Section 
15 5.1.1. RNA samples are obtained from tissues of 

experimental subjects and from corresponding tissues 
of control subjects. Any RNA isolation technique 
which does not select against the isolation of mRNA 
may be utilized for the purification of such RNA 
20 samples. See, for example, Ausubel , F.M. et al , 
eds., 1987-1993, Current Protocols in Molpmlar 
Biology, John Wiley & Sons, Inc. New York, which is 
incorporated herein by reference in its entirety. 
Additionally, large numbers of tissue samples may 
readily be processed using techniques well known to 
those of skill in the art, such as, for example, the 
single-step RNA isolation process of Chomczynski, p. 
(1989, U.S. Patent No. 4,843,155), which is 
incorporated herein by reference in its entirety. 

Transcripts within the collected RNA samples 
which represent RNA produced by differentially 
expressed genes may be identified by utilizing a 
variety of methods which are well known to those of 
skill in the art. For example, differential screening 
35 (Tedder, T.F. et al., 1988, Proc . Natl. Acad. Sci . USA 
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8_5: 208-212) , subtractive hybridization (Hedrick, S.M. 
et al . . 1984, Nature 308 :149-153; Lee, S.W. et al . . 
1984, Proc. Natl. Acad. Sci . USA 88:2825), and, 
preferably, differential display (Liang, P. and 
5 Pardee, A.B., 1992, Science 257 : 967-971 ; U.S. Patent 
No. 5,262,311, which is incorporated herein by 
reference in its entirety) , may be utilized to 
identify nucleic acid sequences derived from genes 
that are differentially expressed. 
10 Differential screening involves the duplicate 

screening of a cDNA library in which one copy of the 
library is screened with a total cell cDNA probe 
corresponding to the mRNA population of one cell type 
while a duplicate copy of the cDNA library is screened 
15 with a total cDNA probe corresponding to the mRNA 

population of a second cell type. For example, one 
cDNA probe may correspond to a total cell cDNA probe 
of a cell type or tissue derived from a control 
subject, while the second cDNA probe may correspond to 
20 a total cell cDNA probe of the same cell type or 

tissue derived from an experimental subject. Those 
clones which hybridize to one probe but not to the 
other potentially represent clones derived from genes 
differentially expressed in the cell type of interest 
25 in control versus experimental subjects. 

Subtractive hybridization techniques generally 
involve the isolation of mRNA taken from two different 
sources, e.g. , control and experimental tissue or cell 
type, the hybridization of the mRNA or single- stranded 
30 cDNA reverse- transcribed from the isolated mRNA, and 
the removal of all hybridized, and therefore double- 
stranded, sequences. The remaining non-hybridized, 
single- stranded cDNAs , potentially represent clones 
derived from genes that are differentially expressed 
35 in the two mRNA sources. Such single-stranded cDNAs 
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are then used as the starting material for the 
construction of a library comprising clones derived 
from differentially expressed genes. 

The differential display technique describes a 
5 procedure, utilizing the well known polymerase chain 
reaction (PCR; the experimental embodiment set forth 
in Mullis, K.B., 1987, U.S. Patent No. 4,683,202) 
which allows for the identification of sequences 
derived from genes which are differentially expressed. 
10 First, isolated RNA is reverse- transcribed into 

single -stranded cDNA, utilizing standard techniques 
which are well known to those of skill in the art. 
Primers for the reverse transcriptase reaction may 
include, but are not limited to, oligo dT- containing 
primers, preferably of the 3' primer type of 
oligonucleotide described below. 

Next, this technique uses pairs of PCR primers, 
as described below, which allow for the amplification 
of clones representing a random subset of the RNA 
transcripts present within any given cell. Utilizing 
different pairs of primers allows each of the mRNA 
transcripts present in a cell to be amplified. Among 
such amplified transcripts may be identified those 
which have been produced from differentially expressed 
25 genes . 

The 3' oligonucleotide primer of the primer pairs 
may contain an oligo dT stretch of 10-13, preferably 
11, dT nucleotides at its 5' end, which hybridizes to 
the poly (A) tail of mRNA or to the complement of a 
cDNA reverse transcribed from an mRNA poly (A) tail. 
Second, in order to increase the specificity of the 3' 
primer, the primer may contain one or more, preferably 
two, additional nucleotides at its 3' end. Because, 
statistically, only a subset of the mRNA derived 
sequences present in the sample of interest will 
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hybridize to such primers, the additional nucleotides 
allow the primers to amplify only a subset of the mRNA 
derived sequences present in the sample of interest. 
This is preferred in that it allows more accurate and 
5 complete visualization and characterization of each of 
the bands representing amplified sequences. 

The 5' primer may contain a nucleotide sequence 
expected, statistically, to have the ability to 
hybridize to cDNA sequences derived from the tissues 
10 of interest. The nucleotide sequence may be an 
arbitrary one, and the length of the 5' 

oligonucleotide primer may range from about 9 to about 
15 nucleotides, with about 13 nucleotides being 
preferred . 

15 Arbitrary primer sequences cause the lengths of 

the amplified partial cDNAs produced to be variable, 
thus allowing different clones to be separated by 
using standard denaturing sequencing gel 
electrophoresis . 

20 PCR reaction conditions should be chosen which 

optimize amplified product yield and specificity, and, 
additionally, produce amplified products of lengths 
which may be resolved utilizing standard gel 
electrophoresis techniques. Such reaction conditions 

25 are well known to those of skill in the art, and 

important reaction parameters include, for example, 
length and nucleotide sequence of oligonucleotide 
primers as discussed above, and annealing and 
elongation step temperatures and reaction times. 

30 The pattern of clones resulting from the reverse 

transcription and amplification of the mRNA of two 
different cell types is displayed via sequencing gel 
electrophoresis and compared. Differentially 
expressed genes are indicated by differences in the 

3 5 two banding patterns. 
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Once potentially differentially expressed gene 
sequences have been identified via bulk techniques 
such as, for example, those described above, the 
differential expression of such putatively 
5 differentially expressed genes should be corroborated. 
Corroboration may be accomplished via, for example, 
such well known techniques as Northern analysis, 
quantitative RT PCR or RNase protection. 

Upon corroboration, the differentially expressed 
10 genes may be further characterized, and may be 

identified as target and/or fingerprint genes, as 
discussed, below, in Section 5.1.3. 

Also, amplified sequences of differentially 
expressed genes obtained through, for example, 
differential display may be used to isolate full 
length clones of the corresponding gene. The full 
length coding portion of the gene may readily be 
isolated, without undue experimentation, by molecular 
biological techniques well known in the art. For 
example, the isolated differentially expressed 
amplified fragment may be labeled and used to screen a 
cDNA library. Alternatively, the labeled fragment may 
be used to screen a genomic library. 

PCR technology may also be utilized to isolate 
2 5 full length cDNA sequences. As described, above, in 
this Section, the isolated, amplified gene fragments 
obtained through differential display have 5' terminal 
ends at some random point within the gene and usually 
have 3' terminal ends at a position corresponding to 
30 the 3' end of the transcribed portion of the gene. 

Once nucleotide sequence information from an amplified 
fragment is obtained, the remainder of the gene ( i.e. . 
the 5' end of the gene, when utilizing differential 
display) may be obtained using, for example, RT-PCR. 
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In one embodiment of such a procedure for the 
identification and cloning of full length gene 
sequences, RNA may be isolated, following standard 
procedures, from an appropriate tissue or cellular 
5 source. A reverse transcription reaction may then be 
performed on the RNA using an oligonucleotide primer 
complimentary to the mRNA that corresponds to the 
amplified fragment, for the priming of first strand 
synthesis. Because the primer is anti-parallel to the 

10 mRNA, extension will proceed toward the 5' end of the 
mRNA. The resulting RNA/DNA hybrid may then be 
"tailed" with guanines using a standard terminal 
transferase reaction, the hybrid may be digested with 
RNAase H, and second strand synthesis may then be 

15 primed with a poly-C primer. Using the two primers, 
the 5' portion of the gene is amplified using PCR. 
Sequences obtained may then be isolated and recombined 
with previously isolated sequences to generate a full- 
length cDNA of the differentially expressed genes of 

20 the invention. For a review of cloning strategies and 
recombinant DNA techniques, see e.g. , Sambrook et al . , 
198 9, Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Press, N.Y.; and Ausubel et al. , 1989, 
Current Protocols in Molecular Biology, Green 

25 Publishing Associates and Wiley Int erscience , N.Y. 

5.1.2. METHODS FOR THE IDENTIFICATION OF PATHWAY 
GENES Methods are described herein for the 

identification of pathway genes. "Pathway gene", as 

3Q used herein, refers to a gene whose gene product 

exhibits the ability to interact with gene products 

involved in body weight disorders and/or to interact 

with gene products which are relevant to appetite or 

body weight regulation. A pathway gene may be 

differentially expressed and, therefore, may have the 

characteristics of a target and/or fingerprint gene. 
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Any method suitable for detecting protein-protein 
interactions may be employed for identifying pathway 
gene products by identifying interactions between gene 
products and gene products known to be involved in 
5 body weight disorders and/or involved in appetite or 
body regulation. Such known gene products may be 
cellular or extracellular proteins. Those gene 
products which interact with such known gene products 
represent pathway gene products and the genes which 
10 encode them represent pathway genes. 

Among the traditional methods which may be 
employed are co- immunoprecipi tat ion , crosslinking and 
co-purification through gradients or chromatographic 
columns. Utilizing procedures such as these allows 
15 for the identification of pathway gene products. Once 
identified, a pathway gene product may be used, in 
conjunction with standard techniques, to identify its 
corresponding pathway gene. For example, at least a 
portion of the amino acid sequence of the pathway gene 
20 product may be ascertained using techniques well known 
to those of skill in the art, such as via the Edman 
degradation technique (see, e.g. . Creighton, 1983, 
"Proteins: Structures and Molecular Principles", W.H. 
Freeman & Co., N.Y., pp. 34-49). The amino acid 
25 sequence obtained may be used as a guide for the 

generation of oligonucleotide mixtures that can be 
used to screen for pathway gene sequences. Screening 
made be accomplished, for example, by standard 
hybridization or PCR techniques. Techniques for the 
generation of oligonucleotide mixtures and the 
screening are well-known. (See, e.g. . Ausubel, 
su P ra : * a nd PCR Protocols: A. Guide to Methods and 
Applications, 1990, Innis, M. et al . , eds . Academic 
Press, Inc., New York). 
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Additionally, methods may be employed which 
result in the simultaneous identification of pathway 
genes which encode the protein interacting with a 
protein involved in body weight disorder states and/or 
5 appetite and body weight regulation. These methods 
include, for example, probing expression libraries 
with labeled protein known or suggested to be involved 
in body weight disorders and/or appetite or body 
weight regulation, using this protein in a manner 
10 similar to the well known technique of antibody 
probing of Xgtll libraries. 

One method which detects protein interactions in 
vivo , the two-hybrid system, is described in detail 
for illustration only and not by way of limitation. 
15 One version of this system has been described (Chien 
et al., 1991, Proc . Natl. Acad. Sci . USA, BB : 9578- 
9582) and is commercially available from Clontech 
(Palo Alto, CA) . 

Briefly, utilizing such a system, plasmids are 
20 constructed that encode two hybrid proteins: one 

consists of the DNA-binding domain of a transcription 
activator protein fused to a known prote:n, ir. :h:c 
case, a protein known to be involved : r. t>c?y we ; 
disorders and or processes relevant t: p ^ : : - *• jr.-j cr 
25 weight regulation, and the other ccr.r ir-tr. :i t fi" 

transcription activator protein's a::;va* n:»-a:r. 
fused to an unknown protein that is encoded by <* cDNA 
which has been recombined into this plasmid as part of 
a cDNA library. The plasmids are transformed into a 
30 strain of the yeast Saccharomvces cerevisiae that 

contains a reporter gene (e.g., lacZ) whose regulatory 
region contains the transcription activator's binding 
sites. Either hybrid protein alone cannot activate 
transcription of the reporter gene: the DNA-binding 
3 5 domain hybrid cannot because it does not provide 
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activation function and the activation domain hybrid 
cannot because it cannot localize to the activator's 
binding sites. Interaction of the two hybrid proteins 
reconstitutes the functional activator protein and 
5 results in expression of the reporter gene, which is 
detected by an assay for the reporter gene product. 

The two-hybrid system or related methodology may 
be used to screen activation domain libraries for 
proteins that interact with a known "bait" gene 
10 product. By way of example, and not by way of 

limitation, gene products known to be involved in body 
weight disorders and/or appetite or body weight 
regulation may be used as the bait gene products. 
These include but are not limited to the intracellular 
domain of receptors for such hormones as neuropeptide 
Y, galanin, interostatin, insulin, and CCK. Total 
genomic or cDNA sequences are fused to the DNA 
encoding an activation domain. This library and a 
plasmid encoding a hybrid of the bait gene product 
fused to the DNA-binding domain are cotransf ormed into 
a yeast reporter strain, and the resulting 
transformants are screened for those that express the 
reporter gene. For example, and not by way of 
limitation, the bait gene can be cloned into a vector 
such that it is translat ionally fused to the DNA 
encoding the DNA-binding domain of the GAL 4 protein. 
These colonies are purified and the library plasmids 
responsible for reporter gene expression are isolated. 
DNA sequencing is then used to identify the proteins 
3 0 encoded by the library plasmids. 

A cDNA library of the cell line from which 
proteins that interact with bait gene product are to 
be detected can be made using methods routinely 
practiced in the art. According to the particular 
system described herein, for example, the cDNA 
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fragments can be inserted into a vector such that they 
are translationally fused to the activation domain of 
GAL4 . This library can be co- transformed along with 
the bait gene-GAL4 fusion plasmid into a yeast strain 
which contains a lacZ gene driven by a promoter which 
contains GAL4 activation sequence- A cDNA encoded 
protein, fused to GAL4 activation domain, that 
interacts with bait gene product will reconstitute an 
active GAL 4 protein and thereby drive expression of 
the lacZ gene. Colonies which express lacZ can be 
detected by their blue color in the presence of X-gal. 
The cDNA can then be purified from these strains, and 
used to produce and isolate the bait gene- interact ing 
protein using techniques routinely practiced in the 
art . 

Once a pathway gene has been identified and 
isolated, it may be further characterized as, for 
example, discussed below, in Section 5.1.3. 

5.1.3. CHARACTER I Z AT I ON OF DIFFERENTIALLY 
EXPRESSED AND PATHWAY GENES 

Differentially expressed genes, such as those 
identified via the methods discussed, above, in 
Section 5.1.1, and pathway genes, such as those 
identified via the methods discussed, above, in 
Section 5.1.2, above, as well as genes identified by 
alternative means, may be further characterized by 
utilizing, for example, methods such as those 
discussed herein. Such genes will be referred to 
herein as "identified genes" . 

Analyses such as those described herein, yield 
information regarding the biological function of the 
identified genes. An assessment of the biological 
function of the differentially expressed genes, in 
addition, will allow for their designation as target 
and/or fingerprint genes. 
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Specifically, any of the differentially expressed 
genes whose further characterization indicates that a 
modulation of the gene's expression or a modulation of 
the gene product's activity may ameliorate any of the 
5 body weight disorders of interest will be designated 
"target genes", as defined, above, in Section 5.1. 
Such target genes and target gene products, along with 
those discussed below, will constitute the focus of 
the compound discovery strategies discussed, below, in 

10 Section 5.3. Further, such target genes, target gene 
products and/or modulating compounds can be used as 
part of the body weight disorder treatment methods 
described, below, in Section 5.4. 

Any of the differentially expressed genes whose 

15 further characterization indicates that such 

modulations may not positively affect body weight 
disorders of interest, but whose expression pattern 
contributes to a gene expression "fingerprint" pattern 
correlative of, for example, a body weight disorder 

20 state will be designated a "fingerprint gene". 

"Fingerprint patterns" will be more fully discussed, 
below, in Section 5.7.1. it should be noted that each 
of the target genes may also function as fingerprint 
genes, as well as may all or a portion of the pathway 

2 5 genes. 

It should further be noted that the pathway genes 
may also be characterized according to techniques such 
as those described herein. Those pathway genes which 
yield information indicating that they are 

30 differentially expressed and that modulation of the 
gene's expression or a modulation of the gene 
product's activity may ameliorate any of the body 
weight disorders of interest will be also be 
designated "target genes". Such target genes and 

35 target gene products, along with those discussed 
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above, will constitute the focus of the compound 
discovery strategies discussed, below, in Section 5.3 
and can be used as part of the treatment methods 
described in Section 5.4, below. 
5 It should be additionally noted that the 

characterization of one or more of the pathway genes 
may reveal a lack of differential expression, but 
evidence that modulation of the gene's activity or 
expression may, nonetheless, ameliorate body weight 
10 disorder symptoms. In such cases, these genes and 

gene products would also be considered a focus of the 
compound discovery strategies of Section 5.3, below. 

In instances wherein a pathway gene's 
characterization indicates that modulation of gene 
15 expression or gene product activity may not positively 
affect body weight disorders of interest, but whose 
expression is differentially expressed and 
contributes to a gene expression fingerprint pattern 
correlative of, for example, a body weight disorder 
20 state, such pathway genes may additionally be 
designated as fingerprint genes. 

A variety of techniques can be utilized to 
further characterize the identified genes. First, the 
nucleotide sequence of the identified genes, which may 
25 be obtained by utilizing standard techniques well 

known to those of skill in the art, may, for example, 
be used to reveal homologies to one or more known 
sequence motifs which may yield information regarding 
the biological function of the identified gene 
3 0 product . 

Second, an analysis of the tissue and/or cell 
type distribution of the mRNA produced by the 
identified genes may be conducted, utilizing standard 
techniques well known to those of skill in the art. 
35 Such techniques may include, for example, Northern, 



- 32 - 



WO 96/05861 



PCT/US95/10918 



RNase protection and RT-PCR analyses. Such analyses 
provide information as to, for example, whether the 
identified genes are expressed in tissues or cell 
types expected to contribute to the body weight 
5 disorders of interest. Such analyses may also provide 
quantitative information regarding steady state mRNA 
regulation, yielding data concerning which of the 
identified genes exhibits a high level of regulation 
in, preferably, tissues which may be expected to 

10 contribute to the body weight disorders of interest. 
Additionally, standard in situ hybridization 
techniques may be utilized to provide information 
regarding which cells within a given tissue express 
the identified gene. Such an analysis may provide 

15 information regarding the biological function of an 
identified gene relative to a given body weight 
disorder in instances wherein only a subset of the 
cells within the tissue is thought to be relevant to 
the body weight disorder. 

20 Third, the sequences of the identified genes may 

be used, utilizing standard techniques, to place the 
genes onto genetic maps, e.g. . mouse (Cope land. N.G. 
and Jenkins, N.A., 1991, Trends in Genetics 7 : l : :* . : i e ; 
and human genetic maps (Cohen, D. , et al . . 1993. 

25 Nature 3.66:698-701). Such mapping ir.f crraticr. may 

yield information regarding the genes' :-prrt ar.c to 
human disease by, for example, identifying genes which 
map within genetic regions to which known genetic body 
weight disorders map. 

30 Fourth, the biological function of the identified 

genes may be more directly assessed by utilizing 
relevant in vivo and in vitro systems. In vivo 
systems may include, but are not limited to, animal 
systems which naturally exhibit body weight disorder- 

3 5 like symptoms, or ones which have been engineered to 
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exhibit such symptoms. Further, such systems may 
include systems for the further characterization of 
body weight disorders, and/or appetite or body weight 
regulation, and may include, but are not limited to, 
5 naturally occurring and transgenic animal systems such 
as those described, above, in Section 5.1.1.1, and 
Section 5.2.4.1, below. In vitro systems may include, 
but are not limited to, cell-based systems comprising 
cell types known or suspected of contributing to the 

10 body weight disorder of interest. Such cells may be 
wild type cells, or may be non-wild type cells 
containing modifications known to, or suspected of, 
contributing to the body weight disorder of interest. 
Such systems are discussed in detail, below, in 

15 Section 5.2.4.2. 

In further characterizing the biological function 
of the identified genes, the expression of these genes 
may be modulated within the in vivo and/or in vitro 
systems, i.e. , either overexpressed or underexpressed 

20 in, for example, transgenic animals and/or cell lines, 
and its subsequent effect on the system then assayed. 
Alternatively, the activity of the product of the 
identified gene may be modulated by either increasing 
or decreasing the level of activity in the in vivo 

25 and/or in vitro system of interest, and its subsequent 
effect then assayed. 

The information obtained through such 
characterizations may suggest relevant methods for the 
treatment of body weight disorders involving the gene 

30 of interest. Further, relevant methods for the 
control of appetite and body weight regulation 
involving the gene of interest may be suggested by 
information obtained from such characterizations. For 
example, treatment may include a modulation of gene 

35 expression and/or gene product activity. 
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Characterization procedures such as those described 
herein may indicate where such modulation should 
involve an incr ase or a decrease in the expression or 
activity of the gene or gene product of interest. 
5 Such methods of treatment are discussed, below, in 
Section 5.4. 

5.2. DIFFERENTIALLY EXPRESSED AND PATHWAY GKfrR fi 

Identified genes, which include, but are not 
limited to, differentially expressed genes such as 
those identified in Section 5.1.1, above, and pathway 
genes, such as those identified in Section 5.1.2, 
above, are described herein. Specifically, the 
nucleic acid sequences and gene products of such 
identified genes are described. Further, antibodies 
directed against the identified genes' products, and 
cell- and animal-based models by which the identified 
genes may be further characterized and utilized are 
also discussed in this Section. 

5.2.1. DIFFERENTIALLY EXPRESSED GENE 
SEQUENCES 

Differentially expressed nucleotide sequences 
are shown in FIGS. 3A, 4-14, 16-17B and 20-21. Table 
1 lists differentially expressed genes (P3, P13, F5, 
F49, murine C5, human C5, L31/F74, L7/L21, L29, L38, 
L37, L57, H27, F84 and L34) identified through, for 
example, the paradigms discussed, above, in Section 
5.1.1.1, and, below, in the examples presented in 
Sections 6-10. Table 1 also summarizes information 
regarding the further characterization of such genes. 
Table 2 lists E. coli clones, deposited with the 
Agricultural Research Service Culture Collection 
(NRRL) , which contain sequences' found within the F4 9 
and human C5 genes listed in Table 1. 
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In Table 1, the differential expression patterns 
revealed via, for example, one or more of the paradigm 
conditions described in Section 5.1.1.1, above, are 
summarized under the column headed "Paradigm 
5 Expression Pattern". For each of the tested genes, 

the paradigm which was used and the difference in the 
expression of the gene in experimental versus control 
tissues is shown. "fl" indicates that gene expression 
is increased ( i.e. . there is an increase in the amount 
10 of detectable mRNA produced by a given gene) in 

experimental versus control tissue or cell type, while 
" li H indicates that gene expression is decreased ( i.e. , 
there is an decrease in the amount of detectable mRNA 
produced by a given gene) in experimental versus 
15 control tissue or cell type. Further, " + u indicates 
that gene expression is activated in experimental 
versus control tissue or cell type, i.e. , mRNA is 
detectable in experimental tissue or cell type whereas 
none is detectable in control tissue or cell type, 
20 while " - " would indicate that gene expression is 

inactivated in experimental versus control tissue or 
cell type, i.e. , while mRNA is detectable in control 
tissue or cell type, it is no longer detectable in 
experimental tissue or cell type. "Detectable" as 
25 used herein, refers to levels of mRNA which are 

detectable via standard differential display, Northern 
and/or RT-PCR techniques which are well known to those 
of skill in the art. " Increased" and "decreased", as 
used herein, refer to an increase or decrease, 
30 respectively in level of mRNA present in experimental 
versus control tissue or cell type which is detectable 
via standard differential display, Northern, and/or 
RT-PCT techniques which are well known to those of 
skill in the art. 
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Tissue expression patterns are also summarized in 
Table l. The column headed "First Detection" 
indicates the first tissue or cell type in which 
differential expression of the gene was detected. The 
column headed "Tissue/Cell Dist . » lists tissues and/or 
cell types in which expression of the gene has been 
tested and whether expression of the gene within a 
given tissue or cell type has been observed. 
Specifically, " + ■» indicates detectable mRNA from the 
gene of interest, while » - » refers to no detectable 
mRNA from the gene of interest . Unless otherwise 
noted, "+» and «-« refer to both control and 
experimental samples. "Detectable", as used herein, 
is as defined earlier in this Section. 
15 Additionally, the physical locus to which the 

gene maps on the human and/or mouse chromosome map is 
indicated in the column headed "Locus". Further, in 
instances wherein the genes correspond to genes known 
to be found in nucleic acid databases, references 
- e - r citations and/or gene names) to such known 
genes are listed in the column headed "Ref". 

The genes listed in Table 1 can be obtained using 
cloning methods well known to those of skill in the 
art, and which include, but are not limited to, the 
use of appropriate probes to detect the genes within 
an appropriate cDNA or gDNA (genomic DNA) library. 
(See, for example, Sambrook et al . , 1989, Molecular 
Cloning: A Laboratory Manual, Cold Spring Hanson 
Laboratories, which is incorporated herein by 
30 reference in its entirety.) Probes for the sequences 
reported herein can be obtained directly from the 
isolated clones deposited with the NRRL, as indicated 
in Table 2, below. Alternatively, oligonucleotide 
probes for the genes can be synthesized based on the 
35 DNA sequences disclosed herein in FIGs . 3A, 4-14, 16- 
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17B and 20-22. With respect to the previously 
reported genes, oligonucleotides can be synthesized or 
produced based on the sequences provided for the 
previously known genes described in the following 
5 references: glutamine synthetase (P3) : Bhandari et 
al., 1991, J. Biol. Chem. 266:7784-7792; islet 
regenerating protein (P13) : Unno, M. et al. , 1987, J. 
Biol. Chem. £68: 15974-15982 ; Terazono, K. et al. , 
1988, J. Biol. Chem. 263 :2111-2114; Watanabe, T. , 
10 1990, J. Biol. Chem. 265 :7432-7439; alpha amylase 

(F5) : Schibler, U. et al., 1986, in "Oxford Surveys on 
Eukaryotic Genes", Maclean, N. , ed. 3.:210, Oxford 
Univ. Press, New York; Schibler et al., 1982, J. Mol. 
Biol. 155:247-266; mouse major urinary protein II 
15 (L31/F74): Shahan, K. et al., 1987, Mol. Cell. Biol. 
1:1938-1946; mouse cytochrome C oxidase Subunit I 
(L7/L21) : Bibb et al., 1981, Cell 26:167-180; mouse 
testosterone 15-alpha hydroxylase (L15) : Squires, E.J. 
and Negism: , M. , 1988, J. Biol. Chem. 263 : 4166-4171 ; 
20 mouse 24p3 (L38) : Flower, D.R. et al., 1991, Biochem. 
Biophys. Res. Comm. 180 : 69-74 ; Hraba-Renevey , S. et 
al., 1989, Oncogene 4:601-608; mouse p6-5 (L37): 
Yamasaki, N. et al., 1987, Eur. J. Immunol. 17_:247- 
253; mouse orphan Nuclear hormone Reception (L57): 
25 Forman et al., 1994, Mol. Endocrinol. 81253-1261; 

autoantigen La (H27): Genbank Accession No. L00993; 
and mouse cytochrome p450 IID (F84) : Matsunaga, E. et 
al., 1990, J. Mol. Evol. 3.0 : 155 — 169 ; mouse L34 
(homolog of the human lrp 130 gene): Hou, J. et al., 
30 1994, In Vitro Cell. Dev. Biol. Anim. 30A: 111-114. 

The probes can be used to screen cDNA libraries 
prepared from an appropriate cell, cell line or tissue 
in which the gene is transcribed. Appropriate cell 
lines can include, for example, . preadipocyte cell 
35 lines such as 3T3-A1 and TA1 mouse preadipocyte cell 
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lines, liver cell lines, such as the Hepal-6 mouse 
liver cell line and the HepG2 human liver cell line. 
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TABLE 1 



Differentially Expressed Genes 



5 


Gene 


Paradigm 1 

Expression 

Pattern 


First 
Detection 


Tissue/ 
Cell 
Dist . 


Locus 


Ref 


10 


P3 (SEQ. ID 
NO : 4 , 3 9) 


fl Pancreas 
(fasted) 


Pancreas 






1 




P13 (SEQ . ID 
NO:7, 40-42) 


fl Pancreas 
(fasted) 


Pancreas 






2 




F5 (SEQ. ID 
NO: 12) 


ftAdipose 
(fasted) 


Adipose 






3 


15 
20 


F4 9 (SEQ. ID 
NO: 34) 


+ Adipose 
(db/db) 


Adipose 


Adipose 

[< + ) 
db/db • 

(-) lean 
control] 
Muscle 

(-) 

Small 
Intestin 
e (-) 
Hypothal 
amus ( - ) 


Chrom 

2 




25 








Liver ( - 
) 

Pancreas 
(-) 


















30 


murine C5 10 
(SEQ. ID NO: 
36) 


n Adipose 
(ob/ob and 
db/db) 


Adipose 









35 
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0 



5 



5 



Gene 


Paradigm 

Expression 

Pattern 


First 
Detection 


Tissue/ 
Cell 
Dist . 


Locus 


Ref 


L31/F74 
(SEQ. ID 
NO: 16) 


0 Liver and 
Adipose 
(ob/ob and 
db/db) 
0 Liver 

(underweight) 


Adipose ; 
Liver 


Liver 

< + ) 

Adipose 

< + ) 

Muscle 
(-) 




4 


L7/L21 (SEQ. 
ID NO: 18) 


fi Liver 
(fasted, 
ob/ob, and 
db/db) 


Liver 






5 


L2 9 (SEQ. ID 
NO : 2 0 ) 


Liver 
(ob/ob) 


Liver 






6 


L38 (SEQ. ID 
NO: 22, 43) 


flLiver (ob/ob 
and db/db) 


Liver 






7 


L37 (SEQ. 
NO:25, 44- 
45) 


(1 Liver 
(ob/ob) 


Liver 






8 


L57 (SEQ. ID 
NO:29, 46- 
48) 


11 Liver 

(underweight) 


Liver 






9 


Human C5 
(SEQ ID 
NO. :38) 






Heart 
(♦) 

Brain ( - ) 
Placenta 
< + ) 

Lung ( + ) 
Liver 
( + ) 

Muscle 
( + ) 

Kidney 
( + ) 

Pancreas 
( + ) 
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Gene 


Paradigm 

Expression 

Pattern 


First 
Detection 


Tissue/ 

Cell 

Dist. 


Locus 


Ref 


5 


H27 (SEQ ID 
NO: 49) 


1 Hypo t ha 1 amus 
(fasted) 


Hypothalamus 






11 




F84 (SEQ ID 
NO: 52, 54) 


8 Adipose 
(underweight) 


Adipose 






12 


10 


L34 (SEQ ID 
NO: 57) 


ft Liver 
(ob/ob) 


Liver 






13 



15 



20 



25 



1991, J. Biol. 

1993, J. 



l Mouse glutamine synthetase: Bhandari et al 
Chem. 266 : 7784-7792 . 

2 Mouse islet regenerating protein: Unno, M. et al 
Biol. Chem. .268:15974-15982. 

3 Mouse or-amylase: Schibler, U. et al . , 1986, in "Oxford 
Surveys on Eukaryotic Genes", Maclean, N . , ed . , 3:210, Oxford 
Univ. press, New York; Schibler et al . , 1982, J. Mol . Biol 
155:247-266. 

*Mouse major urinary protein II: Shahan, K. et al . , 1987, 
Mol. Cell. Biol. 2:1938-1946. 

*Mouse cytochrome C oxidase Subunit I: Raikhinstein, M. and 
Hankoglu, I . , 1993, Proc . Natl. Acad. Sci . USA 90:10509-10513- 
Bibb et al., 1981, Cell 26:167-180. — 

*Mouse testosterone 15 -or hydroxylase: Squires, K.J. and 
Negishi, M . , 1988, J. Biol. Chem. 263:4166-4171. 

7 Mouse 24p3: Flower, D.R. et al . , 1991, Biochem. Biophys . 
Res. Comm. 18£: 69-74; Hraba-Renevey , S. et al . , 1989, Oncogene 4- 
601-608. ~ ' 

•Mouse p6-5: Yamasaki, N . et al., 1987, Eur. J. Immunol 
17 :247-253 . 

9 Mouse orphan nuclear hormone receptor: Forman et al . , 
1994, Mol. Endocrinol. 8:1253-1261. 

lo The mouse C5 sequence was' first identified via sequence 
homology. C5 was then subsequently tested in ob and db mice, at 
which time it was identified to represent a differentially 
expressed gene sequence. 

"Mouse autoantigen La: Genbank Accession No. L00993 . 

"Mouse cytochrome p450 IID: Matsunaga, E. et al., 1990 J 
Mol. Evol. 30:155-169. 

"The mouse L34 gene represents the mouse homolog of the 
human lrp 13 0 gene: Hou, J. et al . , 19 94, In Vitro Cell. Dev 
Biol. Anim. 30A : 111-114 . 
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Table 2, below, lists isolated cDNA clones that 
contain genes listed in Table 1. 

TABLE 2 



GENE 


CDNA 
CLONE 


F4 9 


f amf 049a 
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human C5 



f ahs005a 



As used herein, "differentially expressed g ne" 
(i.e. target and fingerprint gene) or "pathway gene" 
5 refers to (a) a gene containing: at least one of the 
DNA sequences disclosed herein (as shown in FIGS. 3A, 
4-14, 16-17B and 20-22), or contained in the clones 
listed in Table 2, as deposited with the NRRL; (b) any 
DNA sequence that encodes the amino acid sequence 

10 encoded by: the DNA sequences disclosed herein (as 

shown in FIGS. 3A, 4-14, 16-17B and 20-22), contained 
in the clones listed in Table 2, as deposited with the 
NRRL, or contained within the coding region of the 
gene to which the DNA sequences disclosed herein (as 

15 shown in FIGS. 3A, 4-14, 16-17B and 20-22) or 
contained in the clones listed in Table 2, as 
deposited with the NRRL, belong; (c) any DNA sequence 
that hybridizes to the complement of: the coding 
sequences disclosed herein (as shown in FIGS . 3A, 4- 

20 14, 16-17B and 20-22), contained in clones listed in 
Table 2, as deposited with the NRRL, or contained 
within the coding region of the gene to which the DNA 
sequences disclosed herein (as shown in FIGS. 3A, 4- 
14, 16-17B and 20-22) or contained in the clones 

25 listed in Table 2, as deposited with the NRRL. belong, 
under highly stringent conditions, e.g. . hybridization 
to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl 
sulfate (SDS) , 1 mM EDTA at 65°, and washing in 
0.1xSSC/0.1% SDS at 68°C (Ausubel F . M . et al., eds . , 

30 1989 , Current Protocols in Molecular Biology, Vol. I, 
Green Publishing Associates, Inc., and John Wiley & 
sons, Inc., New York, at p. 2.10.3), and encodes a 
gene product functionally equivalent to a gene product 
encoded by a gene of (a) , above.; and/or (d) any DNA 

35 sequence that hybridizes to the complement of: the 
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coding sequences disclosed herein, (as shown in FIGS. 
3A, 4-14, 16-17B and 20-22) contained in the clones 
listed in Table 2, as deposited with the NRRL, or 
contained within the coding region of the gene to 
5 which DNA sequences disclosed herein (as shown in 

FIGS • 3A, 4-14, 16-17B and 20-22) or contained in the 
clones, listed in Table 2, as deposited with the NRRL, 
belong, under less stringent conditions, such as 
moderately stringent conditions, e.g. . washing in 
10 0.2xSSC/0.1% SDS at 42°C (Ausubel et al., 1989, supra ) . 
yet which still encodes a gene product, functionally 
equivalent to a gene product encoded by a gene of (a), 
above . 

The invention also includes nucleic acid 

15 molecules, preferably DNA molecules, that hybridize 
to, and are therefore the complements of, the DNA 
sequences (a) through (d) , in the preceding paragraph. 
Such hybridization conditions may be highly stringent 
or less highly stringent, as described above. In 

20 instances wherein the nucleic acid molecules are 

deoxyoligonucleotides ("oligos") , highly stringent 
conditions may refer, e.g. . to washing in 6xSSC/0.05% 
sodium pyrophosphate at 37°C (for 14-base oligos) , 48°C 
(for 17-base oligos) , 55°C (for 20-base oligos) , and 

25 60°C (for 23-base oligos) . These nucleic acid molecules 
may act as target gene antisense molecules, useful, 
for example, in target gene regulation and/or as 
antisense primers in amplification reactions of 
target, fingerprint, and/or pathway gene nucleic acid 

3 0 sequences. Further, such sequences may be used as 

part of ribozyme and/or triple helix sequences, also 
useful for target gene regulation. Still further, 
such molecules may be used as components of diagnostic 
methods whereby the presence of T or predisposition to, 

35 a body weight disorder, may be detected. 
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The invention also encompasses (a) DNA vectors 
that contain any of the foregoing coding sequences 
and/or their complements ( i.e. . antisense) / (b) DNA 
expression vectors that contain any of the foregoing 
coding sequences operatively associated with a 
regulatory element that directs the expression of the 
coding sequences; and (c) genetically engineered host 
cells that contain any of the foregoing coding 
sequences operatively associated with a regulatory 
element that directs the expression of the coding 
sequences in the host cell. As used herein, 
regulatory elements include but are not limited to 
inducible and non- inducible promoters, enhancers, 
operators and other elements known to those skilled in 
the art that drive and regulate expression. The 
invention includes fragments of any of the DNA 
sequences disclosed herein. 

In addition to the gene sequences described 
above, homologues of these gene sequences as may, for 
example, be present in other species, preferably human 
in instances wherein the above -described gene 
sequences are not human gene sequences, may be 
identified and isolated, without undue 
experimentation, by molecular biological techniques 
25 well known in the art. Further, there may exist genes 
at other genetic loci within the genome that encode 
proteins which have extensive homology to one or more 
domains of such gene products. These genes may also 
be identified via similar techniques. 
30 For example, an isolated differentially expressed 

gene sequence may be labeled and used to screen a cDNA 
library constructed from mRNA obtained from the 
organism of interest. Hybridization conditions will 
be of a lower stringency when the cDNA library was 
35 derived from an organism different from the type of 



20 



- 45 - 



organism from which the labeled sequence was derived. 
Alternatively, the labeled fragment may be used to 
screen a genomic library derived from the organism of 
interest, again, using appropriately stringent 
5 conditions. Such low stringency conditions will be 

well known to those of skill in the art, and will vary 
predictably depending on the specific organisms from 
which the library and the labeled sequences are 
derived. For guidance regarding such conditions see, 

10 for example, Sambrook et al . , 1989, Molecular Cloning, 
A Laboratory Manual, Cold Springs Harbor Press, N.Y.; 
and Ausubel et al . , 1989, Current Protocols in 
Molecular Biology, (Green Publishing Associates and 
Wiley Interscience , N.Y.). 

15 Further, a previously unknown differentially 

expressed or pathway gene-type sequence may be 
isolated by performing PCR using two degenerate 
oligonucleotide primer pools designed on the basis of 
amino acid sequences within the gene of interest . The 

20 template for the reaction may be cDNA obtained by 

reverse transcription of mRNA prepared fror. human or 
non-human cell lines or tissue known cr suspr rt «• t: 
express a differentially expressed cr ptnway aer." 
allele. The PCR product may be subr 1 : r." i ar.: 

25 sequenced to insure that the amplifir : 

represent the sequences of a di f f ereni . a L 1 y ' x; :» £;sed 
or pathway gene-like nucleic acid sequence. 

The PCR fragment may then be used to isolate a 
full length cDNA clone by a variety of methods. For 

30 example, the amplified fragment may be used to screen 
a bacteriophage cDNA library. Alternatively, the 
labeled fragment may be used to screen a genomic 
library . 

PCR technology may also be utilized to isolate 
35 full length cDNA sequences. For example, RNA may be 
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isolated, following standard procedures, from an 
appropriate cellular or tissue source. A reverse 
transcription reaction may be performed on the RNA 
using an oligonucleotide primer specific for the most 
5 5' end of the amplified fragment for the priming of 

first strand synthesis. The resulting RNA/DNA hybrid 
may then be "tailed" with guanines using a standard 
termmal transferase reaction, the hybrid may be 
digested with RNAase H, and second strand synthesis 
10 may then be primed with a poly-c primer. Thus, cDNA 
sequences upstream of the amplified fragment may 
easily be isolated. For a review of cloning 
strategies which may be used, see e.g., Sambrook et 
al., 1989, Molecular Cloning, A Laboratory Manual 
15 Cold Springs Harbor Press, N.Y.; and Ausubel et al . , 
198 9, Current Protocols in Molecular Biology, (Green 
Publishing Associates and Wiley Interscience, N.Y.) 

In cases where the differentially expressed or 
pathway gene identified is the normal, or wild type 
20 gene, this gene may be used to isolate mutant alleles 
of the gene. Such an isolation is preferable in 
processes and disorders which are known or suspected 
to have a genetic basis. Mutant alleles may be 
isolated from individuals either known or suspected to 
have a genotype which contributes to body weight 
disorder symptoms. Mutant alleles and mutant allele 
products may then be utilized in the therapeutic and 
diagnostic assay systems described below. 

A cDNA of the mutant gene may be isolated, for 
example, by using PCR, a technique which is well known 
to those of skill in the art. m this case, the first 
cDNA strand may be synthesized by hybridizing a oligo- 
dT oligonucleotide to mRNA isolated from tissue known 
to, or suspected of, being expressed in an individual 
putatively carrying the mutant allele, and by 
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extending the new strand with reverse transcriptase. 
The second strand of the cDNA is then synthesized 
using an oligonucleotide that hybridizes specifically 
to the 5'- end of the normal gene. Using these two 
5 primers, the product is then amplified via PCR, cloned 
into a suitable vector, and subjected to DNA sequence 
analysis through methods well known to those of skill 
in the art . By comparing the DNA sequence of the 
mutant gene to that of the normal gene, the 

10 mutation (s) responsible for the loss or alteration of 
function of the mutant gene product can be 
ascertained. 

Alternatively, a genomic or cDNA library can be 
constructed and screened using DNA or RNA, 

15 respectively, from a tissue known to or suspected of 
expressing the gene of interest in an individual 
suspected of or known to carry the mutant allele. The 
normal gene or any suitable fragment thereof may then 
be labeled and used as a probe to identify the 

20 corresponding mutant allele in the library. The clone 
containing this gene may then be purified through 
methods routinely practiced in the art, and subjected 
to sequence analysis as described, above, in this 
Section . 

25 Additionally, an expression library can be 

constructed utilizing DNA isolated from or cDNA 
synthesized from a tissue known to or suspected of 
expressing the gene of interest in an individual 
suspected of or known to carry the mutant allele. In 

30 this manner, gene products made by the putatively 
mutant tissue may be expressed and screened using 
standard antibody screening techniques in conjunction 
with antibodies raised against the normal gene 
product, as described, below, in Section 5.2.3. (For 

35 screening techniques, see, for example, Harlow, E. and 
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Lane, eds,, 1988, "Antibodies: A Laboratory Manual", 
Cold Spring Harbor Press, Cold Spring Harbor.) In 
cases where the mutation results in an expressed gene 
product with altered function ( e.g. , as a result of a 
missense mutation) , a polyclonal set of antibodies are 
likely to cross-react with the mutant gene product. 
Library clones detected via their reaction with such 
labeled antibodies can be purified and subjected to 
sequence analysis as described in this Section, above. 



5.2.2. DIFFERENTIALLY EXPRESSED AND 
PATHWAY GENE PRODUCTS 

Differentially expressed and pathway gene 

products include those proteins encoded by the 

differentially expressed and pathway gene sequences 

described in Section 5.2.1, above, as for example, the 

peptides listed in FIGS. 14 (SEQ ID NO: 34), 16 (SEQ 

ID NO: 36), 17A-17B (SEQ. ID NO. : 56) and 18 (SEQ ID 

NO: 51). 

In addition, differentially expressed and pathway 
gene products may include proteins that represent 
functionally equivalent gene products. Such an 
equivalent differentially expressed or pathway gene 
product may contain deletions, additions or 
substitutions of amino acid residues within the amino 
acid sequence encoded by the differentially expressed 
or pathway gene sequences described, above, in Section 
5.2.1, but which result in a silent change, thus 
producing a functionally equivalent differentially 
expressed or pathway gene product. Amino acid 
substitutions may be made on the basis of similarity 
in polarity, charge, solubility, hydrophobic ity, 
hydrophilicity, and/or the amphipathic nature of the 
residues involved. For example, nonpolar 
(hydrophobic) amino acids include alanine, leucine, 
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isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids 
include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutamine; positively 
5 charged (basic) amino acids include arginine, lysine, 
and histidine; and negatively charged (acidic) amino 
acids include aspartic acid and glutamic acid. 
"Functionally equivalent", as utilized herein, refers 
to either a protein capable of exhibiting a 

10 substantially similar in vivo activity as the 

endogenous differentially expressed or pathway gene 
products encoded by the differentially expressed or 
pathway gene sequences described in Section 5.2.1, 
above. Alternatively, when utilized as part of assays 

15 such as those described, below, in Section 5.3, 
"functionally equivalent" may refer to peptides 
capable of interacting with other cellular or 
extracellular molecules in a manner substantially 
similar to the way in which the corresponding portion 

20 of the endogenous differentially expressed or pathway 
gene product would. 

The differentially expressed or pathway gene 
products may be produced by recombinant DNA technology 
using techniques well known in the art. Thus, methods 

25 for preparing the differentially expressed or pathway 
gene polypeptides and peptides of the invention by 
expressing nucleic acid encoding differentially 
expressed or pathway gene sequences are described 
herein. Methods which are well known to those skilled 

30 in the art can be used to construct expression vectors 
containing differentially expressed or pathway gene 
protein coding sequences and appropriate 
transcriptional/translat ional control signals . These 
methods include, for example, in vitro recombinant DNA 

35 techniques, synthetic techniques and in vivo 
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recombination/genetic recombination. See, for 
example, the techniques described in Sambrook et al . , 
1989, Molecular Cloning A Laboratory Manual, Cold 
Spring Harbor Laboratory, N.Y. which is incorporated 
by reference herein in their entirety, and Ausubel, 
198 9, supra . Alternatively, RNA capable of encoding 
differentially expressed or pathway gene protein 
sequences may be chemically synthesized using, for 
example, synthesizers. See, for example, the 
techniques described in "Oligonucleotide Synthesis", 
1984, Gait, M.J. ed. , IRL Press, Oxford, which is 
incorporated by reference herein in its entirety. 

A variety of host -expression vector systems may 
be utilized to express the differentially expressed or 
pathway gene coding sequences of the invention. Such 
host -expression systems represent vehicles by which 
the coding sequences of interest may be produced and 
subsequently purified, but also represent cells which 
may, when transformed or transfected with the 
appropriate nucleotide coding sequences, exhibit the 
differentially expressed or pathway gene protein of 
the invention in situ. These include but are not 
limited to microorganisms such as bacteria ( e.g. . 
SSii. B. subtil is ) transformed with recombinant " 
bacteriophage DNA, plasmid DNA or cosmid DNA expres- 
sion vectors containing differentially expressed or 
pathway gene protein coding sequences; yeast ( e.g. 
Saccharomyces , Pichia ) transformed with recombinant 
yeast expression vectors containing the differentially 
expressed or pathway gene protein coding sequences; 
insect cell systems infected with recombinant virus 
expression vectors (e^., baculovirus) containing the 
differentially expressed or pathway gene protein 
coding sequences; plant cell systems infected with 
35 recombinant virus expression vectors (e.g. . 
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cauliflower mosaic virus, CaMV; tobacco mosaic virus, 
TMV) or transformed with recombinant plasmid expres- 
sion vectors ( e.g. , Ti plasmid) containing 
differentially expressed or pathway gene protein 
5 coding sequences; or mammalian cell systems ( e.g. COS, 
CHO, BHK, 2 93, 3T3) harboring recombinant expression 
constructs containing promoters derived from the 
genome of mammalian cells ( e.g. . metallothionein 
promoter) or from mammalian viruses ( e.g. , the 
10 adenovirus late promoter; the vaccinia virus 7.5K 
promoter) . 

In bacterial systems, a number of expression 
vectors may be advantageously selected depending upon 
the use intended for the differentially expressed or 

15 pathway gene protein being expressed. For example, 
when a large quantity of such a protein is to be 
produced, for the generation of antibodies or to 
screen peptide libraries, for example, vectors which 
direct the expression of high levels of fusion protein 

20 products that are readily purified may be desirable. 
Such vectors include, but are not limited, to the E . 
coli expression vector pUR278 (Ruther et al . , 1983, 
EMBO J. 2:1791), in which the differentially expressed 
or pathway gene protein coding sequence may be ligated 

25 individually into the vector in frame with the lacZ 
coding region so that a fusion protein is produced; 
pIN vectors (Inouye & Inouye, 1985, Nucleic Acids Res. 
13.:3101-3109; Van Heeke & Schuster, 1989, J. Biol. 
Chem. 264:5503-5509); and the like. pGEX vectors may 

30 also be used to express foreign polypeptides as fusion 
proteins with glutathione S- transferase (GST) . In 
general, such fusion proteins, are soluble and can 
easily be purified from lysed cells by adsorption to 
glutathione -agarose beads followed by elution in the 

35 presence of free glutathione. The pGEX vectors are 
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designed to include thrombin or factor Xa protease 
cleavage sites so that the cloned Target gene protein 
can be released from the GST moiety. 

In an insect system, Autographa californica 
nuclear polyhedrosis virus (AcNPV) is used as a vector 
to express foreign genes. The virus grows in 
Spodoptera frugiperda cells. The differentially 
expressed or pathway gene coding sequence may be 
cloned individually into non-essential regions (for 
example the polyhedrin gene) of the virus and placed 
under control of an AcNPV promoter (for example the 
polyhedrin promoter) . Successful insertion of 
differentially expressed or pathway gene coding 
sequence will result in inactivation of the polyhedrin 
15 gene and production of non-occluded recombinant virus 
( i - e • ' virus lacking the proteinaceous coat coded for 
by the polyhedrin gene) . These recombinant viruses 
are then used to infect Spodoptera frugiperda cells in 
which the inserted gene is expressed. (E.g., see 
20 Smith et al . , 1983, J. Viol. 46:584; Smith, U.S. 
Patent No. 4,215,051). 

In mammalian host cells, a number of viral -based 
expression systems may be utilized. In cases where an 
adenovirus is used as an expression vector, the 
25 differentially expressed or pathway gene coding 

sequence of interest may be ligated to an adenovirus 
transcription/translation control complex, e.g. , the 
late promoter and tripartite leader sequence. This 
chimeric gene may then be inserted in the adenovirus 
30 genome by in vitro or in vivo recombination. Inser- 
tion in a non-essential region of the viral genome 
l e '9w region El or E3) will result in a recombinant 
virus that is viable and capable of expressing 
differentially expressed or pathway gene protein in 
35 infected hosts. ( e.g. . See Logan & Shenk, 1984, Proc . 
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Natl. Acad. Sci . USA 81 : 3655-3659) . Specific 
initiation signals may also be required for efficient 
translation of inserted differentially expressed or 
pathway gene coding sequences. These signals include 
5 the ATG initiation codon and adjacent sequences . In 
cases where an entire differentially expressed or 
pathway gene, including its own initiation codon and 
adjacent sequences, is inserted into the appropriate 
expression vector, no additional translat ional control 

10 signals may be needed. However, in cases where only a 
portion of the differentially expressed or pathway 
gene coding sequence is inserted, exogenous 
translational control signals, including, perhaps, the 
ATG initiation codon, must be provided. Furthermore, 

15 the initiation codon must be in phase with the reading 
frame of the desired coding sequence to ensure 
translation of the entire insert. These exogenous 
translational control signals and initiation codons 
can be of a variety of origins, both natural and 

20 synthetic. The efficiency of expression may be 

enhanced by the inclusion of appropriate transcription 
enhancer elements, transcription terminators, etc. 
(see Bittner et al . , 1987, Methods in Enzymol . 
153 : 516-544) . 

25 In addition, a host cell strain may be chosen 

which modulates the expression of the inserted 
sequences, or modifies and processes the gene product 
in the specific fashion desired. Such modifications 
( e.g. , glycosylation) and processing ( e.g. , cleavage) 

3 0 of protein products may be important for the function 
of the protein. Different host cells have charac- 
teristic and specific mechanisms for the post - transla- 
tional processing and modification of proteins . 
Appropriate cell lines or host systems can be chosen 

35 to ensure the correct modification and processing of 
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the foreign protein expressed. To this end, 
eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary 
transcript, glycosylation, and phosphorylation of the 
gene product may be used. Such mammalian host cells 
include but are not limited to CHO, VERO, BHK, HeLa, 
COS, MDCK, 293, 3T3 , WI38, etc. 

For long-term, high-yield production of recombi- 
nant proteins, stable expression is preferred. For 
example, cell lines which stably express the 
differentially expressed or pathway gene protein may 
be engineered. Rather than using expression vectors 
which contain viral origins of replication, host cells 
can be transformed with DNA controlled by appropriate 
15 expression control elements ( e.g. . promoter, enhancer, 
sequences, transcription terminators, polyadenylat ion 
sites, etc.), and a selectable marker. Following the 
introduction of the foreign DNA, engineered cells may 
be allowed to grow for 1-2 days in an enriched media, 
20 and then are switched to a selective media. The 

selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells to stably 
integrate the plasmid into their chromosomes and grow 
to form foci which in turn can be cloned and expanded 
25 into cell lines. This method may advantageously be 
used to engineer cell lines which express the 
differentially expressed or pathway gene protein. 
Such engineered cell lines may be particularly useful 
in screening and evaluation of compounds that affect 
30 the endogenous activity of the differentially 
expressed or pathway gene protein. 

A number of selection systems may be used, 
including but not limited to the herpes simplex virus 
thymidine kinase (Wigler, et al . , 1977, Cell 11:223), 
35 hypoxanthine -guanine phosphor ibosyl trans f erase 



interest. Such regulatory regions will be well known 
to those of skill in the art, and may be utilized in 
the absence of undue experimentation. 

For underexpression of an endogenous target gene 
sequence, such a sequence may be isolated and 
engineered such that when reintroduced into the genome 
of the animal of interest, the endogenous target gene 
alleles will be inactivated. Preferably, the 
engineered target gene sequence is introduced via gene 
targeting such that the endogenous target sequence is 
disrupted upon integration of the engineered target 
gene sequence into the animal ' s genome . Gene 
targeting is discussed, below, in this Section. 

Animals of any species, including, but not 
limited to, mice, rats, rabbits, guinea pigs, pigs, 
micro-pigs, goats, and non-human primates, e.g. , 
baboons, squirrels, monkeys, and chimpanzees may be 
used to generate body weight disorder animal models. 

Any technique known in the art may be used to 
introduce a target gene transgene into animals to 
produce the founder lines of transgenic animals. Such 
techniques include, but are not limited to pronuclear 
microinjection (Hoppe, P.C. and Wagner, T.E., 1989, 
U.S. Pat. No. 4,873,191); retrovirus mediated gene 
transfer into germ lines (Van der Putten et al . , 1985, 
Proc. Natl. Acad. Sci., USA 82 : 6148-6152) ; gene 
targeting in embryonic stem cells (Thompson et al . . 
1989, Cell 56.: 313-321) ; electroporat ion of embryos 
(Lo, 1983, Mol Cell. Biol. 2:1803-1814); and sperm- 
mediated gene transfer (Lavitrano et al . , 1989, Cell 
57^:717-723); etc. For a review of such techniques, 
see Gordon, 1989, Transgenic Animals, Intl. Rev.. 
Cytol . 115 : 171-229 , which is incorporated by reference 
herein in its entirety. 
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The present invention provides for transgenic 
animals that carry the transgene in all their cells, 
as well as animals which carry the transgene in some, 
but not all their cells, i.e. , mosaic animals. (See, 
for example, techniques described by Jakobovits, 1994, 
Curr. Biol. 4: 761-763). The transgene may be 
integrated as a single transgene or in concatamers , 
g i g u» head-to-head tandems or head-to-tail tandems. 
The transgene may also be selectively introduced into 
and activated in a particular cell type by following, 
for example, the teaching of Lasko et al . (Lasko, M. 
et al., 1992, Proc. Natl. Acad. Sci. USA 89:6232- 
6236). The regulatory sequences required for such a 
cell-type specific activation will depend upon the 
15 particular cell type of interest, and will be apparent 
to those of skill in the art. 

When it is desired that the target gene transgene 
be integrated into the chromosomal site of the 
endogenous target gene, gene targeting is preferred. 
20 Briefly, when such a technique is to be utilized, 
vectors containing some nucleotide sequences 
homologous to the endogenous target gene of interest 
are designed for the purpose of integrating, via 
homologous recombination with chromosomal sequences, 
25 into and disrupting the function of, the nucleotide 

sequence of the endogenous target gene. The transgene 
may also be selectively introduced into a particular 
cell type, thus inactivating the endogenous gene of 
interest in only that cell type, by following, for 
30 example, the teaching of Gu et al . (Gu, H. et al. , 

1994, Science 26.5:103-106). The regulatory sequences 
required for such a cell-type specific inactivation 
will depend upon the particular cell type of interest, 
and will be apparent to those of skill in the art. 



35 
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Once transgenic animals have been generated, the 
expression of the recombinant target gene and protein 
may be assayed utilizing standard techniques. Initial 
screening may be accomplished by Southern blot 
analysis or PCR techniques to analyze animal tissues 
to assay whether integration of the transgene has 
taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may 
also be assessed using techniques which include but 
are not limited to Northern blot analysis of tissue 
samples obtained from the animal, in situ 
hybridization analysis, and RT-PCR. Samples of target 
gene-expressing tissue, may also be evaluated 
immunocytochemically using antibodies specific for the 
target gene transgene gene product of interest. 

The target gene transgenic animals that express 
target gene mRNA or target gene transgene peptide 
(detected immunocytochemically, using antibodies 
directed against target gene product epitopes) at 
easily detectable levels should then be further 
evaluated to identify those animals which display 
characteristic body weight disorder-like symptoms. 
Such symptoms may include, for example, obesity, 
anorexia, and an abnormal food intake. Additionally, 
specific cell types within the transgenic animals may 
be analyzed and assayed for cellular phenotypes 
characteristic of body weight disorders. Such 
cellular phenotypes may include, for example, abnormal 
adipocyte differentiation ( e.g. , abnormal 
preadipocyte/adipoctye differentiation) and 
metabolism. Further, such cellular phenotypes may 
include as assessment of a particular cell types 
fingerprint pattern of expression and its comparison 
to known fingerprint expression profiles of the 
particular cell type in animals exhibiting body weight 
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disorders. Such transgenic animals serve as suitable 
model systems for body weight disorders. 

Once target gene transgenic founder animals are 
produced ( i.e. , those animals which express target 
5 gene proteins in cells or tissues of interest, and 
which, preferably, exhibit symptoms of body weight 
disorders) , they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular 
animal. Examples of such breeding strategies include 

10 but are not limited to: outbreeding of founder 

animals with more than one integration site in order 
to establish separate lines; inbreeding of separate 
lines in order to produce compound target gene 
transgenics that express the target gene transgene of 

15 interest at higher levels because of the effects of 
additive expression of each target gene transgene; 
crossing of heterozygous transgenic animals to produce 
animals homozygous for a given integration site in 
order to both augment expression and eliminate the 

20 possible need for screening of animals by DNA 

analysis; crossing of separate homozygous lines to 
produce compound heterozygous or homozygous lines; 
breeding animals to different inbred genetic 
backgrounds so as to examine effects of modifying 

25 alleles on expression of the target gene transgene and 
the development of body weight disorder- like symptoms. 
One such approach is to cross the target gene 
transgenic founder animals with a wild type strain to 
produce an Fl generation that exhibits body weight 

30 disorder- like symptoms, such as obesity, anorexia, and 
abnormal food intake. The Fl generation may then be 
inbred in order to develop a homozygous line, if it is 
found that homozygous target gene transgenic animals 
are viable. 
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5.2.4.2. CELL-BASED ASSAYS 
Cells that contain and express target gene 
sequences which encode target gene protein, and, 
further, exhibit cellular phenotypes associated with a 
5 body weight disorder of interest, may be utilized to 
identify compounds that exhibit an ability to 
ameliorate body weight disorder symptoms. Cellular 
phenotypes which may indicate an ability to ameliorate 
body weight disorders may include, for example, 

10 inhibition of adipose cell differentiation ( e.g. , an 
inhibition of differentiation of preadipocytes into 
adipocytes) and an inhibition of the ability of 
adipocytes to synthesize fat. 

Further, the fingerprint pattern of gene 

15 expression of cells of interest may be analyzed and 
compared to the normal, non-body weight disorder 
fingerprint pattern. Those compounds which cause 
cells exhibiting body weight disorder-like cellular 
phenotypes to produce a fingerprint pattern more 

20 closely resembling a normal fingerprint pattern for 

the cell of interest may be considered candidates for 
further testing regarding an ability to ameliorate 
body weight disorder symptoms. 

Cells which be utilized for such assays may, for 

25 example, include non- recombinant cell lines, such as 
preadipocyte cell lines such as 3T3-L1 and TA1 mouse 
preadipocyte cell lines, liver cell lines, such as the 
Hepal-6 mouse liver cell line, and the HepG2 human 
liver cell line. 

30 Further, cells which may be used for such assays 

may also include recombinant, transgenic cell lines. 
For example, the body weight disorder animal models of 
the invention, discussed, above, in Section 5.2.4.1, 
may be used to generate cell lines, containing one or 

35 more cell types involved in body weight disorders, 
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that can be used as cell culture models for this 
disorder. While primary cultures derived from the 
body weight disorder transgenic animals of the 
invention may be utilized, the generation of 
5 continuous cell lines is preferred. For examples of 
techniques which may be used to derive a continuous 
cell line from the transgenic animals, see Small et 
3l ■ , 1985, Mol. Cell Biol. 5:642-648. 

Alternatively, cells of a cell type known to be 

10 involved in body weight disorders may be transfected 

with sequences capable of increasing or decreasing the 
amount of target gene expression within the cell. For 
example, target gene sequences may be introduced into, 
and overexpressed in, the genome of the cell, of 

15 interest, or, if endogenous target gene sequences are 
present, they may either be overexpressed or, 
alternatively, be disrupted in order to underexpress 
or inactivate target gene expression. 

In order to overexpress a target gene sequence, 

2 0 the coding portion of the target gene sequence may be 
ligated to a regulatory sequence which is capable of 
driving gene expression in the cell type of interest. 
Such regulatory regions will be well known to those of 
skill in the art, and may be utilized in the absence 

25 of undue experimentation. 

For underexpression of an endogenous target gene 
sequence, such a sequence may be isolated and 
engineered such that when reintroduced into the genome 
of the cell type of interest, the endogenous target 

30 gene alleles will be inactivated. Preferably, the 

engineered target gene sequence is introduced via gene 
targeting such that the endogenous target sequence is 
disrupted upon integration of the engineered target 
gene sequence into the cell's genome. Gene targeting 

35 is discussed, above, in Section 5.4.2.1. 
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Transfection of target gene sequence nucleic acid 
may be accomplished by utilizing standard techniques. 
See, for example, Ausubel, 1989, supra . Transfected 
cells should be evaluated for the presence of the 
recombinant target gene sequences, for expression and 
accumulation of target gene mRNA, and for the presence 
of recombinant target gene protein production. In 
instances wherein a decrease in target gene expression 
is desired, standard techniques may be used to 
demonstrate whether a decrease in endogenous target 
gene expression and/or in target gene product 
production is achieved. 

5.3. SCREENING ASSAYS FOR COMPOUNDS 
THAT INTERACT WITH THE TARGET 
GENE PRODUCT 

The following assays are designed to identify 
compounds that bind to target gene products, bind to 
other cellular proteins that interact with a target 
gene product, and to compounds that interfere with the 
interaction of the target gene product with other 
cellular proteins. Such compounds may include, but 
are not limited to, other cellular proteins. Methods 
for the identification of such cellular proteins are 
described, below, in Section 5.3.2. 

Compounds may include, but are not limited to, 
peptides such as, for example, soluble peptides, 
including but not limited to, Ig-tailed fusion 
peptides, comprising extracellular portions of target 
gene product transmembrane receptors, and members of 
random peptide libraries; (see, e.g. . Lam, K.S. et 
al . , 1991, Nature 354 : 82-84 ; Houghten, R. et al . , 
1991, Nature 354:84-86), made of D- and/or L- 
conf iguration amino acids, phosphopeptides (including, 
but not limited to members of random or partially 
degenerate, directed phosphopeptide libraries; see, 
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e .g. , Songyang, Z. et al . , 1993, Cell 72:767-778), 
antibodies (including, but not limited to, polyclonal, 
monoclonal, humanized, anti - idiotypic , chimeric or 
single chain antibodies, and FAb, F(ab') 2 and FAb 
5 expression library fragments, and epitope -binding 
fragments thereof) , and small organic or inorganic 
molecules . 

Compounds identified via assays such as those 
described herein may be useful, for example, in 

10 elaborating the biological function of the target gene 
product, and for ameliorating body weight disorders. 
In instances, for example, whereby a body weight 
disorder situation results from a lower overall level 
of target gene expression, target gene product, and/or 

15 target gene product activity in a cell or tissue 

involved in such a body weight disorder, compounds 
that interact with the target gene product may include 
ones which accentuate or amplify the activity of the 
bound target gene protein. Such compounds would bring 

20 about an effective increase in the level of target 
gene activity, thus ameliorating symptoms . In 
instances whereby mutations within the target gene 
cause aberrant target gene proteins to be made which 
have a deleterious effect that leads to a body weight 

25 disorder, compounds that bind target gene protein may 
be identified that inhibit the activity of the bound 
target gene protein. Assays for testing the 
effectiveness of compounds, identified by, for 
example, techniques such as those described in Section 

30 5.3.1-5.3.3, are discussed, below, in Section 5.3.4. 



5.3.1. IN VITRO SCREENING ASSAYS FOR 
COMPOUNDS THAT BIND TO THE 
TARGET GENE PRODUCT 

In vitro systems may be designed to identify 

compounds capable of binding the target gene products 
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of the invention. Compounds identified may be useful, 
for example, in modulating the activity of wild type 
and/or mutant target gene products, may be useful in 
elaborating the biological function of the target gene 
5 product, may be utilized in screens for identifying 
compounds that disrupt normal target gene product 
interactions, or may in themselves disrupt such 
interactions . 

The principle of the assays used to identify 

10 compounds that bind to the target gene product 

involves preparing a reaction mixture of the target 
gene product and the test compound under conditions 
and for a time sufficient to allow the two components 
to interact and bind, thus forming a complex which can 

15 be removed and/or detected in the reaction mixture. 
These assays can be conducted in a variety of ways. 
For example, one method to conduct such an assay would 
involve anchoring target gene product or the test 
substance onto a solid phase and detecting target gene 

20 product/test compound complexes anchored on the solid 
phase at the end of the reaction. In one embodiment 
of such a method, the target gene product may be 
anchored onto a solid surface, and the test compound, 
which is not anchored, may be labeled, either directly 

25 or indirectly. 

In practice, microtiter plates may conveniently 
be utilized as the solid phase. The anchored 
component may be immobilized by non-covalent or 
covalent attachments. Non-covalent attachment may be 

30 accomplished by simply coating the solid surface with 
a solution of the protein and drying. Alternatively, 
an immobilized antibody, preferably a monoclonal 
antibody, specific for the protein to be immobilized 
may be used to anchor the protein to the solid 
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surface. The surfaces may be prepared in advance and 
stored. 

In order to conduct the assay, the nonimmobilized 
component is added to the coated surface containing 
the anchored component. After the reaction is 
complete, unreacted components are removed ( e.g. . by 
washing) under conditions such that any complexes 
formed will remain immobilized on the solid surface. 
The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. 
Where the previously nonimmobilized component is pre- 
_ labeled, the detection of label immobilized on the 
surface indicates that complexes were formed. Where 
the previously nonimmobilized component is not: pre- 
15 labeled, an indirect label can be used to detect 
complexes anchored on the surface; e.g. . using a 
labeled antibody specific for the previously 
nonimmobilized component (the antibody, in turn, may 
be directly labeled or indirectly labeled with a 
20 labeled anti-Ig antibody) . 

Alternatively, a reaction can be conducted ir. a 
liquid phase, the reaction products F^p.i: .i : : i : ~~ 

unreacted components, and complexes cic*: j_ . 

using an immobilized antibody specif. t : :.irj- # ;*r.<* 
25 product or the test compound to anch: : u- . X es 
formed in solution, and a labeled ar.t.: r/ r^^v.i.z 
for the other component of the possible corr.piex to 
detect anchored complexes . 

30 5.3.2. ASSAYS FOR CELLULAR PROTEINS 

THAT INTERACT WITH THE TARGET GENE 
PROTEIN 

Any method suitable for detecting protein-protein 

interactions may be employed for identifying novel 

target protein-cellular or extracellular protein 

35 

interactions. These methods are outlined in section 
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5.1.2., above, for the identification of pathway- 
genes, and may be utilized herein with respect to the 
identification of proteins which interact with 
identified target proteins. 
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5.3.3. ASSAYS FOR COMPOUNDS THAT INTERFERE 
WITH TARGET GENE PRODUCT / CELLULAR 
MACROMOLECULE INTERACTION 

The target gene products of the invention may, in 
vivo, interact with one or more cellular or 
extracellular macromolecules , such as proteins. Such 
macromolecules may include, but are not limited to, 
nucleic acid molecules and those proteins identified 
via methods such as those described, above, in Section 
5.3.2. For purposes of this discussion, such cellular 
and extracellular macromolecules are referred to 
herein as "binding partners'*. Compounds that disrupt 
such interactions may be useful in regulating the 
activity of the target gene product, especially mutant 
target gene products. Such compounds may include, but 
are not limited to molecules such as antibodies, 
peptides, and the like, as described, for example, in 
Section 5.3.1. above. 

The basic principle of the assay systems used to 
identify compounds that interfere with the interaction 
between the target gene product and its cellular or 
extracellular binding partner or partners involves 
preparing a reaction mixture containing the target 
gene product, and the binding partner under conditions 
and for a time sufficient to allow the two to interact 
and bind, thus forming a complex. In order to test a 
compound for inhibitory activity, the reaction mixture 
is prepared in the presence and absence of the test 
compound. The test compound may be initially included 
in the reaction mixture, or may be added at a time 
subsequent to the addition of target gene product and 
its cellular or extracellular binding partner. 
Control reaction mixtures are incubated without the 
test compound or with a placebo. The formation of any 
complexes between the target gene protein and the 
cellular or extracellular binding partner is then 
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detected. The formation of a complex in the control 
reaction, but not in the reaction mixture containing 
the test compound, indicates that the compound 
interferes with the interaction of the target gene 
5 protein and the interactive binding partner. 

Additionally, complex formation within reaction 
mixtures containing the test compound and normal 
target gene protein may also be compared to complex 
formation within reaction mixtures containing the test 
10 compound and a mutant target gene protein. This 

comparison may be important in those cases wherein it 
is desirable* to identify compounds that disrupt 
interactions of mutant but not normal target gene 
proteins . 

15 The assay for compounds that interfere with the 

interaction of the target gene products and binding 
partners can be conducted in a heterogeneous or 
homogeneous format. Heterogeneous assays involve 
anchoring either the target gene product or the 

20 binding partner onto a solid phase and detecting 

complexes anchored on the solid phase at the end of 
the reaction. In homogeneous assays. th«* entire 
reaction is carried out in a liquid pr.aFf I r. citner 
approach, the order of addition of : ••art ar.tr cur. t*- 

25 varied to obtain different informat ; : : t 

compounds being tested. For example. t*-r: r-— j. .;nds 
that interfere with the interaction between tr^ target 
gene products and the binding partners, e.g. . by 
competition, can be identified by conducting the 

30 reaction in the presence of the test substance; i.e. . 
by adding the test substance to the reaction mixture 
prior to or simultaneously with the target gene 
protein and interactive cellular or extracellular 
binding partner. Alternatively, test compounds that 

3 5 disrupt preformed complexes, e.g. compounds with 
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higher binding constants that displace one of the 
components from the complex, can be tested by adding 
the test compound to the reaction mixture after 
complexes have been formed. The various formats are 
5 described briefly below. 

In a heterogeneous assay system, either the 
target gene protein or the interactive cellular or 
extracellular binding partner, is anchored onto a 
solid surface, while the non-anchored species is 

10 labeled, either directly or indirectly. In practice, 
microtiter plates are conveniently utilized. The 
anchored species may be immobilized by non-covalent or 
covalent attachments. Non-covalent attachment may be 
accomplished simply by coating the solid surface with 

15 a solution of the target gene product or binding 

partner and drying. Alternatively, an immobilized 
antibody specific for the species to be anchored may 
be used to anchor the species to the solid surface. 
The surfaces may be prepared in advance and stored. 

20 In order to conduct the assay, the partner of the 

immobilized species is exposed to the coated surface 
with or without the test compound. After the reaction 
is complete, unreacted components are removed ( e.g. . 
by washing) and any complexes formed will remain 

25 immobilized on the solid surface. The detection of 
complexes anchored on the solid surface can be 
accomplished in a number of ways. Where the non- 
immobilized species is pre-labeled, the detection of 
label immobilized on the surface indicates that 

3 0 complexes were formed. Where the non- immobilized 

species is not pre-labeled, an indirect label can be 
used to detect complexes anchored on the surface; 
e » q . . using a labeled antibody specific for the 
initially non-immobilized species (the antibody, in 

35 turn, may be directly labeled or indirectly labeled 
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with a labeled anti-Ig antibody) . Depending upon the 
order of addition of reaction components , test 
compounds which inhibit complex formation or which 
disrupt preformed complexes can be detected. 
5 Alternatively, the reaction can be conducted in a 

liquid phase in the presence or absence of the test 
compound, the reaction products separated from 
unreacted components, and complexes detected; e.g. . 
using an immobilized antibody specific for one of the 

10 binding components to anchor any complexes formed in 
solution, and a labeled antibody specific for the 
other partner to detect anchored complexes. Again, 
depending upon the order of addition of reactants to 
the liquid phase, test compounds which inhibit complex 

15 or which disrupt preformed complexes can be 
identified. 

In an alternate embodiment of the invention, a 
homogeneous assay can be used. In this approach, a 
preformed complex of the target gene protein and the 

20 interactive cellular or extracellular binding partner 
is prepared in which either the target gene product or 
its binding partners is labeled, but the signal 
generated by the label is quenched due to complex 
formation (see, e.g. , U.S. Patent No. 4,109,496 by 

25 Rubenstein which utilizes this approach for 

immunoassays) . The addition of a test substance that 
competes with and displaces one of the species from 
the preformed complex will result in the generation of 
a signal above background. In this way, test 

30 substances which disrupt target gene protein/cellular 
or extracellular binding partner interaction can be 
identified. 

In a particular embodiment, the target gene 
product can be prepared for immobilization using 
35 recombinant DNA techniques described in Section 5.2.1, 
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above. For example, the target gene coding region can 
be fused to a glutathione-S-transf erase (GST) gene 
using a fusion vector, such as pGEX-5X-l, in such a 
manner that its binding activity is maintained in the 
5 resulting fusion protein. The interactive cellular or 
extracellular binding partner can be purified and used 
to raise a monoclonal antibody, using methods 
routinely practiced in the art and described above, in 
Section 5.2.3. This antibody can be labeled with the 

10 radioactive isotope 125 I, for example, by methods 

routinely practiced in the art. In a heterogeneous 
assay, e.g. , the GST-target gene fusion protein can be 
anchored to glutathione-agarose beads. The 
interactive cellular or extracellular binding partner 

15 can then be added in the presence or absence of the 

test compound in a manner that allows interaction and 
binding to occur. At the end of the reaction period, 
unbound material can be washed away, and the labeled 
monoclonal antibody can be added to the system and 

20 allowed to bind to the complexed components. The 

interaction between the target gene protein and the 
interactive cellular or extracellular binding partner 
can be detected by measuring the amount of 
radioactivity that remains associated with the 

25 glutathione -agarose beads. A successful inhibition of 
the interaction by the test compound will result in a 
decrease in measured radioactivity. 

Alternatively, the GST-target gene fusion protein 
and the interactive cellular or extracellular binding 

30 partner can be mixed together in liquid in the absence 
of the solid glutathione-agarose beads. The test 
compound can be added either during or after the 
species are allowed to interact. This mixture can 
then be added to the glutathione-agarose beads and 

3 5 unbound material is washed away. Again the extent of 
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inhibition of the target gene product/binding partner 
interaction can be detected by adding the labeled 
antibody and measuring the radioactivity associated 
with the beads . 

In another embodiment of the invention, these 
same techniques can be employed using peptide 
fragments that correspond to the binding domains of 
the target gene protein and/or the interactive 
cellular or extracellular binding partner (in cases 
where the binding partner is a protein) , in place of 
one or both of the full length proteins. Any number 
of methods routinely practiced in "the art can be used 
to identify and isolate the binding sites. These 
methods include, but are not limited to, mutagenesis 
of the gene encoding one of the proteins and screening 
for disruption of binding in a co-immunoprecipitation 
assay. Compensating mutations in the gene encoding 
the second species in the complex can then be 
selected. Sequence analysis of the genes encoding the 
respective proteins will reveal the mutations that 
correspond to the region of the protein involved in 
interactive binding. Alternatively, one protein can 
be anchored to a solid surface using methods described 
in this Section above, and allowed to interact with 
and bind to its labeled binding partner, which has 
been treated with a proteolytic enzyme, such as 
trypsin. After washing, a short, labeled peptide 
comprising the binding domain may remain associated 
with the solid material, which can be isolated and 
identified by amino acid sequencing. Also, once the 
gene coding for the for the cellular or extracellular 
binding partner is obtained, short gene segments can 
be engineered to express peptide fragments of the 
protein, which can then be tested for binding activity 
and purified or synthesized. 
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For example, and not by way of limitation, a 
target gene product can be anchored to a solid 
material as described, above, in this Section by 
making a GST-target gene fusion protein and allowing 
5 it to bind to glutathione agarose beads . The 

interactive cellular or extracellular binding partner 
can be labeled with a radioactive isotope, such as 35 S, 
and cleaved with a proteolytic enzyme such as trypsin. 
Cleavage products can then be added to the anchored 

10 GST-target gene fusion protein and allowed to bind. 
After washing away unbound peptides, labeled bound 
material, representing the cellular or extracellular 
binding partner binding domain, can be eluted, 
purified, and analyzed for amino acid sequence by 

15 well-known methods. Peptides so identified can be 
produced synthetically or fused to appropriate 
facilitative proteins using recombinant DNA 
technology . 

20 5.3.4. ASSAYS FOR AMELIORATION OF BODY WEIGHT 

DISORDER SYMPTOMS __ 

Any of the binding compounds, including but not 

limited to, compounds such as those identified in the 

foregoing assay systems, may be tested for the ability 

25 to ameliorate body weight disorder symptoms, which may 
include, for example, obesity, anorexia, and/or an 
abnormal level of food intake. Cell -based and animal 
model -based assays for the identification of compounds 
exhibiting such an ability to ameliorate body weight 

30 disorder symptoms are described below. 

First, cell -based systems such as those 
described, above, in Section 5.2.4.2, may be used to 
identify compounds which may act to ameliorate body 
weight disorder symptoms. For example, such cell 

35 systems may be exposed to a compound, suspected of 
exhibiting an ability to ameliorate body weight 
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substantially normal levels of target gene activity 
are maintained, therefore, nucleic acid molecules that 
encode and express target gene polypeptides exhibiting 
normal target gene activity may, be introduced into 
cells via gene therapy methods such as those 
described, below, in Section 5.5. that do not contain 
sequences susceptible to whatever antisense, ribozyme, 
or triple helix treatments are being utilized. 
Alternatively, in instances whereby the target gene 
encodes an extracellular protein, it may be preferable 
to coadminister normal target gene protein in order to 
maintain the requisite level of target gene activity. 

Anti-sense RNA and DNA, ribozyme, and triple 
helix molecules of the invention may be prepared by 
any method known in the art for the synthesis of DNA 
and RNA molecules . These include techniques for 
chemically synthesizing oligodeoxyribonucleo tides and 
oligoribonucleotides well known in the art such as for 
example solid phase phosphoramidite chemical 
synthesis. Alternatively, RNA molecules may be 
generated by in vitro and in vivo transcription of DNA 
sequences encoding the antisense RNA molecule. Such 
DNA sequences may be incorporated into a wide variety 
of vectors which incorporate suitable RNA polymerase 
promoters such as the T7 or SP6 polymerase promoters . 
Alternatively, antisense cDNA constructs that 
synthesize antisense RNA constitut ively or inducibly, 
depending on the promoter used, can be introduced 
stably into cell lines. 

Various well-known modifications to the DNA 
molecules may be introduced as a means of increasing 
intracellular stability and half -life. Possible 
modifications include but are not limited to, the 
addition of flanking sequences of ribo- or deoxy- 
nucleotides to the 5' and/or 3' ends of the molecule 
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or the use of phosphorothioate or 2' 0-methyl rather 
than phosphodiesterase linkages within the 
oligodeoxyribonucleotide backbone . 

5 5.4.1.2. ANTIBODIES FOR INHIBITION OF 

TARGET GENE PROPHETS 

Antibodies that are both specific for target gene 

protein and interfere with its activity may be used to 

inhibit target gene function. Where desirable, 

10 antibodies specific for mutant target protein which 
interfere with the activity of such mutant target 
product may also be used to inhibit target gene 
function. Such antibodies may be generated using 
standard techniques described in Section 5.2.3., 

15 supra , against the proteins themselves or against 

peptides corresponding to portions of the proteins. 
The antibodies include but are not limited to 
polyclonal, monoclonal, Fab fragments, single chain 
antibodies, chimeric antibodies, etc. 

20 In instances where the target gene protein is 

intracellular and whole antibodies are used, 
internalizing antibodies may be preferred. However, 
lipofectin or liposomes may be used to deliver the 
antibody or a fragment of the Fab region which binds 

25 to the target gene product epitope into cells. Where 
fragments of the antibody are used, the smallest 
inhibitory fragment which binds to the target 
protein's binding domain is preferred. For example, 
peptides having an amino acid sequence corresponding 

30 to the domain of the variable region of the antibody 
that binds to the target gene protein may be used. 
Such peptides may be synthesized chemically or 
produced via recombinant DNA technology using methods 
well known in the art ( e.g. , see Creighton, 1983, 

35 supra ; and Sambrook et al . , 1989, supra ) . 
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Alternatively, single chain neutralizing 
antibodies which bind to intracellular target gene 
product epitopes may also be administered. Such 
single chain antibodies may be administered, for 
5 example, by expressing nucleotide sequences encoding 
single -chain antibodies within the target cell 
population by utilizing, for example, techniques such 
as those described in Marasco et al . (Marasco, W. et 
al., 1993, Proc. Natl. Acad. Sci . USA 90.: 7889-7893 ) . 

10 In instances where the target gene protein is 

extracellular, or is a transmembrane protein, any of 
the administration techniques described, below in 
Section 5.6 which are appropriate for peptide 
administration may be utilized to effectively 

15 administer inhibitory target gene antibodies to their 
site of action. 

5.4.2. METHODS FOR RESTORING TARGET GENE ACTIVITY 
Target genes that cause body weight disorders may 

20 be underexpressed within body weight disorder 

situations. Alternatively, the activity of target 
gene products may be diminished, leading to the 
development of body weight disorder symptoms. 
Described in this Section are methods whereby the 

25 level of target gene activity may be increased to 
levels wherein body weight disorder symptoms are 
ameliorated. The level of gene activity may be 
increased, for example, by either increasing the level 
of target gene product present or by increasing the 

30 level of active target gene product which is present. 

For example, a target gene protein, at a level 
sufficient to ameliorate body weight disorder symptoms 
may be administered to a patient exhibiting such 
symptoms. Any of the techniques discussed, below, in 

35 Section 5.6, may be utilized for such administration. 
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One of skill in the art will readily know how to 
determine the concentration of effective, non-toxic 
doses of the normal target gene protein, utilizing 
techniques such as those described, below, in Section 
5 5.6.1. 

Further, patients may be treated by gene 
replacement therapy. One or more copies of a normal 
target gene or a portion of the gene that directs the 
production of a normal target gene protein with target 

10 gene function, may be inserted into cells, using 
vectors which include, but are not limited to 
adenovirus, adeno-associated virus, and retrovirus 
vectors, in addition to other particles that introduce 
DNA into cells, such as liposomes. Additionally, 

15 techniques such as those described above may be 

utilized for the introduction of normal target gene 
sequences into human cells. 

Cells, preferably, autologous cells, containing 
normal target gene expressing gene sequences may then 

20 be introduced or reintroduced into the patient at 
positions which allow for the amelioration of body 
weight disorder symptoms. Such cell replacement 
techniques may be preferred, for example, whon the 
target gene product is a secreted, extracellular aene 

2 5 product. 

Additionally, antibodies may a^iniPUTfi 
which specifically bind to a target protein and. by 
binding, serve to, either directly or indirectly, 
activate the target protein function. Such antibodies 

30 can include, but are not limited to polyclonal, 

monoclonal, FAb fragments, single chain antibodies, 
chimeric antibodies and the like. The antibodies may 
be generated using standard techniques such as those 
described, above, in Section 5.2.3., and may be 

35 generated against the protein themselves or against 
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proteins corresponding to portions of the proteins. 
The antibodies may be administered, for example, 
according to the techniques described, above, in 
Section 5.4.1.2. 

5 

5.6. PHARMACEUTICAL PREPARATIONS 

AND METHODS OF ADMINISTRATION 

The identified compounds, nucleic acid molecules 

and cells that affect target gene expression, 

synthesis and/or activity can be administered to a 

patient at therapeutically effective doses to treat or 

ameliorate body weight disorders. A therapeutically 

effective dose refers to that amount of the compound 

sufficient to result in amelioration of symptoms of 

body weight disorder, or alternatively, to that amount 

of a nucleic acid molecule sufficient to express a 

concentration of gene product which results in the 

amelioration of such symptoms. 



5.6.1. EFFECTIVE DOSE 
Toxicity and therapeutic efficacy of such 
compounds can be determined by standard pharmaceutical 
procedures in cell cultures or experimental animals, 
e.g. , for determining the LD S0 (the dose lethal to 50% 
of the population) and the ED 50 (the dose 
therapeutically effective in 50% of the population) . 
The dose ratio between toxic and therapeutic effects 
is the therapeutic index and it can be expressed as 
the ratio LD 50 /ED 50 . Compounds which exhibit large 
therapeutic indices are preferred. While compounds 
that exhibit toxic side effects may be used, care 
should be taken to design a delivery system that 
targets such compounds to the site of affected tissue 
in order to minimize potential damage to uninfected 
cells and, thereby, reduce side effects. 
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The data obtained from the cell culture assays 
and animal studies can be used in formulating a range 
of dosage for use in humans . The dosage of such 
compounds lies preferably within a range of 
5 circulating concentrations that include the ED 50 with 
little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and 
the route of administration utilized. For any 
compound used in the method of the invention, the 

10 therapeutically effective dose can be estimated 

initially from cell culture assays. A dose may be 
formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC 50 
(i : e . , the concentration of the test compound which 

15 achieves a half -maximal inhibition of symptoms) as 

determined in cell culture. Such information can be 
used to more accurately determine useful doses in 
humans. Levels in plasma may be measured, for 
example, by high performance liquid chromatography. 

20 

5.6.2. FORMULATIONS AND USE 
Pharmaceutical compositions for use in accordance 
with the present invention may be formulated in 
conventional manner using one or more physiologically 
25 acceptable carriers or excipients. 

Thus, the compounds and their physiologically 
acceptable salts and solvates may be formulated for 
administration by inhalation or insufflation (either 
through the mouth or the nose) or oral, buccal, 
30 parenteral or rectal administration. 

For oral administration, the pharmaceutical 
compositions may take the form of, for example, 
tablets or capsules prepared by conventional means 
with pharmaceutically acceptable excipients such as 
35 binding agents ( e.g. , pregelat inised maize starch, 
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polyvinylpyrrolidone or hydroxypropyl 
methylcellulose) ; fillers ( e.g. . lactose, 
microcrystalline cellulose or calcium hydrogen 
phosphate) ; lubricants ( e.g. , magnesium stearate, talc 
5 or silica) ; disintegrants ( e.g. , potato starch or 
sodium starch glycollate) ; or wetting agents ( e.g. , 
sodium lauryl sulphate) . The tablets may be coated by 
methods well known in the art. Liquid preparations 
for oral administration may take the form of, for 
10 example, solutions, syrups or suspensions, or they may 
be presented as a dry product for constitution with 
water or other suitable vehicle before use . Such 
liquid preparations may be prepared by conventional 
means with pharmaceut ically acceptable additives such 
15 as suspending agents ( e.g. , sorbitol syrup, cellulose 
derivatives or hydrogenated edible fats) ; emulsifying 
agents ( e.g. , lecithin or acacia) ; non-aqueous 
vehicles ( e.g. , almond oil, oily esters, ethyl alcohol 
or fractionated vegetable oils) ; and preservatives 
2 0 ( e.g. , methyl or propyl -p-hydroxybenzoates or sorbic 
acid) . The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as 
appropriate . 

Preparations for oral administration may be 
25 suitably formulated to give controlled release of the 
active compound. 

For buccal administration the compositions may 
take the form of tablets or lozenges formulated in 
conventional manner. 
30 For administration by inhalation, the compounds 

for use according to the present invention are 
conveniently delivered in the form of an aerosol spray 
presentation from pressurized packs or a nebuliser, 
with the use of a suitable propellant, e.g. , 
35 dichlorodif luoromethane , trichlorof luoromethane , 
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dichlorotetraf luoroethane, carbon dioxide or other 
suitable gas. In the case of a pressurized aerosol 
the dosage unit may be determined by providing a valve 
to deliver a metered amount. Capsules and cartridges 
5 of e.g. gelatin for use in an inhaler or insufflator 
may be formulated containing a powder mix of the 
compound and a suitable powder base such as lactose or 
starch . 

The compounds may be formulated for parenteral 
10 administration by injection, e.g. , by bolus injection 
or continuous infusion. Formulations for injection 
may be presented in unit dosage form, e.g. . in 
ampoules or in mult i -dose containers, with an added 
preservative. The compositions may take such forms as 
15 suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as 
suspending, stabilizing and/or dispersing agents. 
Alternatively, the active ingredient may be in powder 
form for constitution with a suitable vehicle, e.g. . 
20 sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal 
compositions such as suppositories or retention 
enemas, e.g. . containing conventional suppository 
bases such as cocoa butter or other glycerides . 
25 In addition to the formulations described 

previously, the compounds may also be formulated as a 
depot preparation. Such long acting formulations may 
be administered by implantation (for example 
subcutaneously or intramuscularly) or by intramuscular 
injection. Thus, for example, the compounds may be 
formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable 
oil) or ion exchange resins, or as sparingly soluble 
derivatives, for example, as a sparingly soluble salt. 

35 



30 
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The compositions may, if desired, be presented in 
a pack or dispenser device which may contain one or 
more unit dosage forms containing the active 
ingredient . The pack may for example comprise metal 
5 or plastic foil, such as a blister pack. The pack or 
dispenser device may be accompanied by instructions 
for administration. 

5.7. DIAGNOSIS OF BODY WEIGHT DISORDER ABNORMALITIES 

10 A variety of methods may be employed for the 

diagnosis of body weight disorders, predisposition to 
body weight disorders, for monitoring the efficacy of 
antibody weight disorder compounds during, for 
example, clinical trials and for monitoring patients 

15 undergoing clinical evaluation for the treatment of 
such body weight disorders. 

Such methods may, for example, utilize reagents 
such as the fingerprint gene nucleotide sequences 
described in Sections 5.1, and antibodies directed 

20 against differentially expressed and pathway gene 
peptides, as described, above, in Sections 5.1.3 
(peptides) and 5.2.3 (antibodies). Specifically, such 
reagents may be used, for example, for: (1) the 
detection of the presence of target gene mutations, or 

25 the detection of either over- or under-expression of 
target gene mRNA relative to the non-body weight 
disorder state; and (2) the detection of either an 
over- or an under -abundance of target gene product 
relative to the non-body weight disorder state. 
3 0 The methods described herein may be performed, 

for example, by utilizing pre-packaged diagnostic kits 
comprising at least one specific fingerprint gene 
nucleic acid or anti-fingerprint gene antibody reagent 
described herein, which may be conveniently used, 
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e.g. t in clinical settings, to diagnose patients 
exhibiting body weight disorder abnormalities. 

Any cell type or tissue in which the fingerprint 
gene is expressed may be utilized in the diagnostics 
5 described below. 

Among the methods which can be utilized herein 
are methods for monitoring the efficacy of compounds 
in clinical trails for the treatment of body weight 
disorders. Such compounds can, for example, be 
10 compounds such as those described, above, in Section 
5.4. Such a method comprises detecting, in a patient 
sample, a gene transcript or gene product which is 
differentially expressed in a body weight disorder 
state relative to its expression in a normal, or non- 
15 body weight disorder, state. 

Any of the nucleic acid detection techniques 
described, below, in Section 5.7.1 or any of the 
peptide detection described, below, in Section 5.7.2 
can be used to detect the gene transcript or gene 
20 product which is differentially expressed in a body 
weight disorder relative to its expression in the 
normal, or non- immune disorder, state. 

During clinical trials, for example, the 
expression of a single fingerprint gene, or 
25 alternatively, a fingerprint pattern of a cell 

involved in a body weight disorder can be determined 
in the presence or absence of the compound being 
tested. The efficacy of the compound can be followed 
by comparing the expression data obtained to the 
30 corresponding known expression patterns in a normal, 
non-body weight disorder state. Compounds exhibiting 
efficacy are those which alter the single fingerprint 
gene expression and/or the fingerprint pattern to more 
closely resemble that of the normal, non-body weight 
35 disorder state. 
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The detection of the product or products of genes 
differentially expressed in a body weight disorder 
state relative to their expression in a normal, or 
non-body weight disorder, state can also be used for 
5 monitoring the efficacy of potential anti-body weight 
disorder compounds during clinical trials. During 
clinical trials, for example, the level and/or 
activity of the products of one or more such 
differentially expressed genes can be determined in 
relevant cells and/or tissues in the presence or 
absence of the compound being tested. The efficacy of 
the compound can be followed by comparing the protein 
level and/or activity data obtained to the 
corresponding known levels/activities for the cells 
and/or tissues in a normal, non-body weight disorder 
state. Compounds exhibiting efficacy are those which 
alter the pattern of the cell and/or tissue involved 
in the body weight disorder to more closely resemble 
that of the normal, non-body weight disorder state. 

5.7.1. DETECTION OF FINGERPRINT GENE 
NUCLEIC ACIDS 

DNA or RNA from the cell type or tissue to be 

analyzed may easily be isolated using procedures which 

are well known to those in the art. Diagnostic 

procedures may also be performed "in situ" directly 

upon tissue sections (fixed and/or frozen) of patient 

tissue obtained from biopsies or resections, such that 

no nucleic acid purification is necessary. Nucleic 

acid reagents such as those described in Section 5.1 

may be used as probes and/or primers for such in situ 

procedures (see, for example, Nuovo, G.J., 1992, "PCR 

In Situ Hybridization: Protocols And Applications", 

Raven Press, NY) . 

Fingerprint gene nucleotide sequences, either RNA 

or DNA, may, for example, be used in hybridization or 
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amplification assays of biological samples to detect 
body weight disorder-related gene structures and 
expression. Such assays may include, but are not 
limited to, Southern or Northern analyses, single 
5 stranded conformational polymorphism analyses, in situ 
hybridization assays, and polymerase chain reaction 
analyses. Such analyses may reveal both quantitative 
aspects of the expression pattern of the fingerprint 
gene, and qualitative aspects of the fingerprint gene 

10 expression and/or gene composition. That is, such 

techniques may include, for example, point mutations, 
insertions, deletions, chromosomal rearrangements, 
and/or activation or inactivation of gene expression. 
Diagnostic methods for the detection of 

15 fingerprint gene-specific nucleic acid molecules may 

involve for example, contacting and incubating nucleic 
acids, derived from the cell type or tissue being 
analyzed, with one or more labeled nucleic acid 
reagents as are described in Section 5.1, under 

20 conditions favorable for the specific annealing of 

these reagents to their complementary sequences within 
the nucleic acid molecule of interest. Preferably, 
the lengths of these nucleic acid reagents are at 
least 15 to 30 nucleotides. After incubation, all 

25 non-annealed nucleic acids are removed from the 
nucleic acid : fingerprint molecule hybrid. The 
presence of nucleic acids from the cell type or tissue 
which have hybridized, if any such molecules exist, is 
then detected. Using such a detection scheme, the 

30 nucleic acid from the cell type or tissue of interest 
may be immobilized, for example, to a solid support 
such as a membrane, or a plastic surface such as that 
on a microtiter plate or polystyrene beads . In this 
case, after incubation, non-annealed, labeled nucleic 

35 acid reagents of the type described in Section 5.1 are 
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easily removed. Detection of the remaining, annealed, 
labeled fingerprint nucleic acid reagents is 
accomplished using standard techniques well-known to 
those in the art. 

Alternative diagnostic methods for the detection 
of fingerprint gene specific nucleic acid molecules 
may involve their amplification, e.g., by PCR (the 
experimental embodiment set forth in Mullis, K.B., 
1987, U.S. Patent No. 4,683,202), ligase chain 
reaction (Barany, F., 1991, Proc . Natl. Acad. Sci. USA 
8t8: 189-193) , self sustained sequence replication 
(Guatelli, J.C. et al . . 1990, Proc. Natl. Acad. Sci. 
USA 87.: 1874-1878) , transcriptional amplification 
system (Kwoh, D.Y et al . , 1989, Proc. Natl. Acad. Sci. 
USA .86: 1173-1177) , Q-Beta Replicase (Lizardi, P.M. et 
al . , 1988, Bio/Technology ,6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 
well known to those of skill in the art. These 
detection schemes are especially useful for the 
detection of nucleic acid molecules if such molecules 
are present in very low numbers. 

In one embodiment of such a detection scheme, a 
cDNA molecule is obtained from an RNA molecule of 
interest ( e.g. , by reverse transcription of the RNA 
molecule into cDNA) . Cell types or tissues from which 
such RNA may be isolated include any tissue in which 
wild type fingerprint gene is known to be expressed. 
A sequence within the cDNA is then used as the 
template for a nucleic acid amplification reaction, 
such as a PCR amplification reaction, or the like. 
The nucleic acid reagents used as synthesis initiation 
reagents ( e.g. , primers) in the reverse transcription 
and nucleic acid amplification steps of this method 
are chosen from among the fingerprint gene nucleic 
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acid reagents described in Section 5.1. The preferred 
lengths of such nucleic acid reagents are at least 9- 
30 nucleotides. For detection of the amplified 
product, the nucleic acid amplification may be 
5 performed using radioactively or non-radioactively 

labeled nucleotides. Alternatively, enough amplified 
product may be made such that the product may be 
visualized by standard ethidium bromide staining or by 
utilizing any other suitable nucleic acid staining 
10 method. 

In addition to methods which focus primarily on 
the detection of one fingerprint nucleic acid 
sequence, fingerprint patterns or profiles may also be 
assessed in such detection schemes. "Fingerprint 
15 pattern" or "fingerprint profile" , as used herein, 

refers to the pattern of mRNA expression obtained for 
a given tissue or cell type under a given set of 
conditions, and includes the mRNA expression of at 
least two of the genes within the tissue or cell type. 
20 Such conditions may include, but are not limited to 
body weight disorders, including obesity, and 
conditions relevant to processes involved : r. bo-r 
weight or appetite regulation, including ar.y c! 
control or experimental conditions der:r;t^j 
25 paradigms of Section 5.1.1.1, above. * i r..T? r; : : 

profiles may be generated, for example, try u ::.::;r.g a 
differential display procedure, as discussed, aDove, 
in Section 5.1.1.2, Northern analysis and/or RT-PCR. 
Any of the gene sequences described, above, in Section 
5.2.1 may be used as probes and/or PCR primers for the 
generation and corroboration of such fingerprint 
profiles . 



30 



35 
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5.7.2. DETECTION OF TARGET GENE PEPTIDES 
Antibodies directed against wild type or mutant 
fingerprint gene peptides, which are discussed, above, 
in Section 5.2.3, may also be used as body weight 
5 disorder diagnostics and prognostics, as described, 

for example, herein. Such diagnostic methods, may be 
used to detect abnormalities in the level of 
fingerprint gene protein expression, or abnormalities 
in the structure and/or temporal, tissue, cellular, or 
10 subcellular location of fingerprint gene protein. 
Structural differences may include, for example, 
differences in the size, electronegativity, or 
antigenicity of the mutant fingerprint gene protein 
relative to the normal fingerprint gene protein. 
15 Protein from the tissue or cell type to be 

analyzed may easily be isolated using techniques which 
are well known to those of skill in the art. The 
protein isolation methods employed herein may, for 
example, be such as those described in Harlow and Lane 
20 (Harlow, E . and Lane ? D., 1988, "Antibodies: A 

Laboratory Manual", Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York) , which is 
incorporated herein by reference in its entirety. 

Preferred diagnostic methods for the detection of 
25 wild type or mutant fingerprint gene peptide molecules 
may involve, for example, immunoassays wherein 
fingerprint gene peptides are detected by their 
interaction with an anti- fingerprint gene product- 
specific antibody . 
3 0 For example, antibodies, or fragments of 

antibodies, such as those described, above, in Section 
5.2.3, useful in the present invention may be used to 
quantitatively or qualitatively detect the presence of 
wild type or mutant fingerprint ^ene peptides. This 
3 5 can be accomplished, for example, by 
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immunofluorescence techniques employing a 
f luorescently labeled antibody (see below, this 
Section) coupled with light microscopic, flow 
cytometric, or fluorimetric detection. Such 
5 techniques are especially preferred if the fingerprint 
gene peptides are expressed on the cell surface. 

The antibodies (or fragments thereof) useful in 
the present invention may, additionally, be employed 
histologically, as in immunofluorescence or 

10 immunoelectron microscopy, for in situ detection of 
fingerprint gene peptides. In situ detection may be 
accomplished by removing a histological specimen from 
a patient, and applying thereto a labeled antibody of 
the present invention. The antibody (or fragment) is 

15 preferably applied by overlaying the labeled antibody 
(or fragment) onto a biological sample. Through the 
use of such a procedure, it is possible to determine 
not only the presence of the fingerprint gene 
peptides, but also their distribution in the examined 

20 tissue. Using the present invention, those of 

ordinary skill will readily perceive that any of a 
wide variety of histological methods (such as staining 
procedures) can be modified in order to achieve such 
in situ detection. 

25 Immunoassays' for wild type or mutant fingerprint 

gene peptides typically comprise incubating a 
biological sample, such as a biological fluid, a 
tissue extract, freshly harvested cells, or cells 
which have been incubated in tissue culture, in the 

30 presence of a detectably labeled antibody capable of 
identifying fingerprint gene peptides, and detecting 
the bound antibody by any of a number of techniques 
well-known in the art. 

The biological sample may be brought in contact 

35 with and immobilized onto a solid phase support or 
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carrier such as nitrocellulose, or other solid support 
which is capable of immobilizing cells, cell particles 
or soluble proteins. The support may then be washed 
with suitable buffers followed by treatment with the 
5 detectably labeled fingerprint gene specific antibody. 
The solid phase support may then be washed with the 
buffer a second time to remove unbound antibody. The 
amount of bound label on solid support may then be 
detected by conventional means. 

10 By "solid phase support or carrier" is intended 

any support capable of binding an antigen or an 
antibody. Well-known supports or carriers include 
glass, polystyrene, polypropylene, polyethylene, 
dextran, nylon, amylases, natural and modified 

15 celluloses, polyacrylamides , gabbros, and magnetite. 
The nature of the carrier can be either soluble to 
some extent or insoluble for the purposes of the 
present invention. The support material may have 
virtually any possible structural configuration so 

20 long as the coupled molecule is capable of binding to 
an antigen or antibody. Thus, the support 
configuration may be spherical, as in a bead, or 
cylindrical, as in the inside surface cf a test tube, 
or the external surface of a rod. Alterna: : vcly, the 

25 surface may be flat such as a sheet, ter. : ctrif . etc. 
Preferred supports include polystyrer.v Lt*aai Those 
skilled in the art will know many other suitable 
carriers for binding antibody or antigen, or will be 
able to ascertain the same by use of routine 

3 0 experimentation . 

The binding activity of a given lot of anti-wild 
type or ant i -mutant fingerprint gene product antibody 
may be determined according to well known methods. 
Those skilled in the art will be able to determine 

35 
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operative and optimal assay conditions for each 
determination by employing routine experimentation. 

One of the ways in which the fingerprint gene 
peptide-specif ic antibody can be detectably labeled is 
5 by linking the same to an enzyme and use in an enzyme 
immunoassay (EIA) (Voller, A. , "The Enzyme Linked 
Immunosorbent Assay (ELISA) " , 1978, Diagnostic 
Horizons 2:1-7, Microbiological Associates Quarterly 
Publication, Walkersville , MD) ; Voller, A. et al . , 

10 1978, J. Clin. Pathol. 31:507-520; Butler, J.E., 1981, 
Meth. Enzymol. 73:482-523; Maggio, E. (ed. ) , 1980, 
ENZYME IMMUNOASSAY, CRC Press, Boca Raton, FL, ; 
Ishikawa, E. et al . , (eds.), 1981, ENZYME IMMUNOASSAY, 
Kgaku Shoin, Tokyo) . The enzyme which is bound to the 

15 antibody will react with an appropriate substrate, 

preferably a chromogenic substrate, in such a manner 
as to produce a chemical moiety which can be detected, 
for example, by spectrophotometric , fluorimetric or by 
visual means. Enzymes which can be used to detectably 

20 label the antibody include, but are not limited to, 

malate dehydrogenase, staphylococcal nuclease, delta- 
s' steroid isomerase, yeast alcohol dehydrogenase, 
alpha-glycerophosphate , dehydrogenase , triose 
phosphate isomerase, horseradish peroxidase, alkaline 

25 phosphatase, asparaginase, glucose oxidase, beta- 
galactosidase, ribonuclease , urease, catalase, 
glucose -6 -phosphate dehydrogenase, glucoamylase and 
acetylcholinesterase. The detection can be 
accomplished by colorimetric methods which employ a 

3 0 chromogenic substrate for the enzyme. Detection may 
also be accomplished by visual comparison of the 
extent of enzymatic reaction of a substrate in 
comparison with similarly prepared standards. 

Detection may also be accomplished using any of a 

3 5 variety of other immunoassays. For example, by 
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radioactively labeling the antibodies or antibody 
fragments, it is possible to detect fingerprint gene 
wild type or mutant peptides through the use of a 
radioimmunoassay (RIA) (see, for example, Weintraub, 
5 B., Principles of Radioimmunoassays, Seventh Training 
Course on Radioligand Assay Techniques, The Endocrine 
Society, March, 1986, which is incorporated by 
reference herein) . The radioactive isotope can be 
detected by such means as the use of a gamma counter 

10 or a scintillation counter or by autoradiography. 

It is also possible to label the antibody with a 
fluorescent compound. When the f luorescently labeled 
antibody is exposed to light of the proper wave 
length, its presence can then be detected due to 

15 fluorescence. Among the most commonly used 

fluorescent labeling compounds are fluorescein 
isothiocyanate , rhodamine , phycoerythrin, phycocyanin, 
allophycocyanin, o-phthaldehyde and f luorescamine . 

The antibody can also be detectably labeled using 

20 fluorescence emitting metals such as 1S2 Eu, or others 
of the lanthanide series. These metals can be 
attached to the antibody using such metal chelating 
groups as diethylenetriaminepentacetic acid (DTPA) or 
ethylenediaminetetraacetic acid (EDTA) . 

2 5 The antibody also can be detectably labeled by 

coupling it to a chemiluminescent compound. The 
presence of the chemiluminescent -tagged antibody is 
then determined by detecting the presence of 
luminescence that arises during the course of a 
30 chemical reaction. Examples of particularly useful 
chemiluminescent labeling compounds are luminol , 
isoluminol, theromatic acridinium ester, imidazole, 
acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used 

3 5 to label the antibody of the present invention. 
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Bioluminescence is a type of chemiluminescence found 
in biological systems in, which a catalytic protein 
increases the efficiency of the chemiluminescent 
reaction. The* presence of a bioluminescent protein is 
5 determined by detecting the presence of luminescence. 
Important bioluminescent compounds for purposes of 
labeling are luciferin, lucif erase and aequorin. 

6. EXAMPLE: IDENTIFICATION AND 

n CHARACTER I ZAT I ON OF AN 

° OBESITY-RELATED GENE 

In the Example presented in this Section, one of 

the paradigms described, above, in Section 5.1.1.1, 

the genetic obesity paradigm, is utilized to identify 

a gene which is not only differentially expressed in 

5 genetically obese test animals, but is identical to a 

gene which has previously been implicated in processes 

involved in body weight regulation. Thus, the 

successful identification, here, of this gene 

corroborates the usefulness of the paradigm approach 

1 0 

of the invention for the identification of genes 
involved in body weight disorders and/or in body 
weight or appetite regulation. 



6.1. MATERIALS AND METHODS 

2 5 

Genetic obesity paradigms : 15 female C57B1/6J 

ob/ob mice and lean litter-mate controls (15 female 
C57B1/6J ?/+) and 15 male C57B1/KS db/db mice and lean 
littermate controls (15 male C57Bl/ks + / + ) were 
received from Jackson labs at 4.5 weeks of "age, and 
30 housed individually on normal mouse chow (West, D.B., 
1992, Am. J. Physiol. 262. : R1025 -R1032 ) for 1 week 
prior to the initiation of the study. The four groups 
of 15 mice each were then sacrificed by C0 2 euthanasia 
and tissues were collected. Body weight (grams) of 

35 
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the four groups of mice at the time of sacrifice was 
measured . 

Tissue collection and RNA isolation : Following 
5 C0 2 asphyxiation, tissues were removed and quick frozen 
on dry ice . Samples within an experimental or control 
group (15 animals per group) were then homogenized 
together with a mortar and pestle under liquid 
nitrogen . 

10 Total cellular RNA was extracted from tissue with 

either RNAzol™ or RNAzolB™ (Tel -Test, Friendswood, 
TX) , according to the manufacturer's instructions. 
Briefly, the tissue was solubilized in an appropriate 
amount of RNAzol™ or RNAzolB™ , and RNA was extracted 

15 by the addition of 1/10 v/v chloroform to the 

solubilized sample followed by vigorous shaking for 
approximately 15 seconds. The mixture was then 
centrifuged for 15 minutes at 12,000g and the aqueous 
phase was removed to a fresh tube. RNA was 

20 precipitated with isopropanol . The resultant RNA 

pellet was dissolved in water and re-extracted with an 
equal volume of chloroform to remove any remaining 
phenol. The extracted volume was precipitated with 2 
volumes of ethanol in the presence of 150mM sodium 

25 acetate. The precipitated RNA was dissolved in water 
and the concentration determined spectroscopically 

(^26o) - 

Differential display : Total cellular RNA (10- 

30 50/xg) was treated with '20 Units DNase I (Boehringer 
Mannheim, Germany) in the presence of 40 Units 
ribonuclease inhibitor (Boehringer Mannheim, Germany) . 
After extraction with phenol /chloroform and ethanol 
precipitation, the RNA was dissolved in DEPC (diethyl 
35 pyrocarbonate) -treated water. 
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Differential mRNA display was carried out as 
described, above, in Section 5.1.1.2. RNA (0.4-2/xg) 
was reverse- transcribed using Superscript reverse 
transcriptase (GIBCO/BRL) . The cDNAs were then 
5 amplified by PCR on a Perkin-Elmer 9600 thermal 
cycler. The reaction mixtures (2 0/zl) included 
arbitrary decanucleotides and one of twelve possible 
T 12 VN sequences, wherein V represents either dG, dC, or 
dA, and N represents either dG, dT, dA, or dC. 

10 Parameters for the 40 cycle PCR were as follows: Hold 
94° C. 2 minutes; Cycle (40 rounds) 94° C. 15 seconds, 
40° C. 2 minutes; Ramp to 72° 30 seconds; Hold 72° C. 5 
minutes; Hold 4° C. 

Radiolabelled PCR amplification products were 

15 analyzed by electrophoresis on 6% denaturing 
polyacrylamide gels . 

Reamplif ication and subclonina : PCR bands of 

interest were recovered from sequencing gels and 

20 reamplif ied. 

Briefly, autoradiograms were aligned with the 
dried gel, and the region containing the bands of 
interest was excised with a scalpel. The excised gel 
fragment was eluted by soaking in 100/il TE (Tris-EDTA) 

25 buffer at approximately 100° C. for 15 minutes. The 
gel slice was then pelleted by brief cent r if ugat ion 
and the supernatant was transferred to a new 
microcentrifuge tube. DNA was combined with ethanol 
in the presence of lOOmM Sodium acetate and 30/ig 

30 glycogen (Boerhinger Mannhein, Germany) and 

precipitated on dry ice for approximately 10 minutes. 
Samples were centrifuged for 10 minutes and pellets 
were washed with 80% ethanol. Pellets were 
resuspended in 10/il distilled water. 
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5jzl of the eluted DNA were reamplif ied in a lOOptl 
reaction containing: standard Cetus Taq polymerase 
buffer, 20/xM dNTPs, 1/xM of each of the oligonucleotide 
primers used in the initial generation of the 
5 amplified DNA. Cycling conditions used were the same 
as the initial conditions used to generate the 
amplified band, as described above. One-half of the 
amplification reaction was run on a 2% agarose gel and 
eluted using DE-81 paper (Whatman Paper, Ltd., 

10 England) as described in Sambrook et al . , supra . 

Recovered fragments were ligated into the cloning 
vector pCR tro II (Invitrogen, Inc., San Diego CA) and 
transformed into competent E . coli strain DHSor 
(Gibco/BRL, Gaithersburg , MD) . Colonies were grown on 

15 LB-agar plates containing ampicillin (100/xg/ml) and X- 
gal <40Mg/ml) to permit blue/white selection. 

Sequence analysis : After subcloning, reamplified 
cDNA fragments were sequenced on an Applied Biosystems 

20 Automated Sequencer (Applied Biosystems, Inc. Seattle, 
WA) . Sequence was obtained from four independent 
transf ormants containing the same insert. The 
nucleotide sequence shown herein represents the 
consensus of the information obtained from the four 

25 sequences. Such primary sequence data was edited and 
trimmed of vector sequences and highly repetitive 
sequences and used to search Genbank databases using 
the BLAST (Altschul, S.F. et al . , 1990, J. Mol . Biol. 
215.: 4 03-410) program. 

30 

Northern analysis : RNA samples were 
electrophoresed in a denaturing agarose gel containing 
1-1.5% agarose (SeaKem™ LE, FMC BioProducts, Rockland, 
ME) containing 6.3% formaldehyde. Samples containing 
35 5-20/ig of total RNA were mixed with denaturing loading 
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solution (72% deionized formamide and bromophenol 
blue) and heated to 70°C. for 5 minutes. Samples were 
placed on ice and immediately loaded onto gels. Gels 
were run in lx MOPS buffer (lOOmM MOPS, 25mM sodium 
5 acetate, 5mM EDTA) . After electrophoresis, the gels 
were stained with ethidium bromide and visualized with 
ultraviolet light. 

After completion of electrophoresis, gels were 
soaked in 50mM sodium hydroxide with gentle agitation 

10 for approximately 3 0 minutes to lightly cleave RNA. 

Gels were rinsed twice in water and then neutralized 
by soaking in 0 . 1M Tris-HCl (pH 7.5) for approximately 
30 minutes. Gels were briefly equilibrated with 20x 
SSC (3M sodium chloride, 0 . 3M sodium citrate) and then 

15 transferred to nylon membranes such as Hybond™, -N, 

(Amersham, Inc., Arlington Heights, IL) or Zeta-Probe 
(Bio-Rad, Inc., Hercules, CA) overnight in 20x SSC. 
Membranes containing transferred RNA were baked at 
80°C. for 2 hours to immobilize the RNA. 

2 0 DNA fragments to be used as probes were of 

various sizes and were labeled using a random hexamer 
labeling technique. Briefly, 25ng of a purified DNA 
fragment was used to generate each probe. Fragments 
were added to a 20/il random hexanucleot ide labeling 

25 reaction (Boehringer Mannhein, Inc., Indianapolis, IN) 
containing random hexamers and a mix of the 
nucleotides dCTP, dGTP, and dTTP (at a final 
concentration of 25^M each) . The reaction mix was 
heat-denatured at 100°C. for 10 minutes and then 

30 chilled on ice. 5/xl of a- 32 P-dATP (SO^iCi; Amersham, 

Inc., Arlington Heights, IL) and Klenow DNA polymerase 
(2 units; Boehringer Mannheim, Inc., Indianapolis, IN) 
were added. Reactions were incubated at 3 7° for 30 
minutes. Following incubation, 30^x1 water was added 

35 to the labeling reaction and unincorporated 
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nucleotides were removed by passing the reactions 
through a BioSpin-6™ chromatography column (Bio-Rad, 
Inc., Hercules, CA) . Specific incorporation was 
determined using a scintillation counter. 1-5x10* cpm 
5 were used per ml hybridization mixture. 

Nylon membranes containing immobilized RNA were 
prehybridized according to manufacturer's 
instructions. Radiolabelled probes were heat 
denatured at 70°C. in 50% deionized formamide for 10 

10 minutes and ten added to the hybridization mixture 

(containing 50% formamide, 10% dextran sulfate, 0.1% 
SDS, lOOjzg/ml sheared salmon sperm DNA, 5x SSC, 5x 
Denhardt's solution, 30mM Tris-HCl (pH 8.5), 50mM NaPO< 
(pH 6.5). Hybridizations were carried out at 42°C. 

15 overnight. Nylon membranes were then bathed for 2 

minutes in a wash solution of 0 . 2x SSC and 0.1% SDS at 
room temperature to remove most of the remaining 
hybridization solution. The membranes were then 
bathed twice in fresh 42°C. preheated wash solution for 

20 20 minutes. Filters. were covered in plastic wrap and 
exposed to autoradiographic film to visualize results. 

6.2. RESULTS 
Genetic obesity paradigms were utilized to 
25 identify genes which are differentially expressed in 

obese versus lean mice. Specifically, ob/ob and db/db 
obese mice were utilized in conjunction with lean 
littermate control mice, as described, above, in 
Section 6.1. 

30 RNA samples isolated from liver tissue of the 

ob/ob . db/db , and littermate control mice were 
analyzed via differential display techniques. FIG. 1 
shows amplified fragments obtained from these tissues 
when subjected to PCR with 11 separate primer pair 

35 combinations. The arrow in FIG. 1, indicates a PCR 
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product, designated band L36, which was judged to be 
differentially expressed among the lean and obese (ob 
and db) samples, with a larger amount of expression in 
the obese relative to the lean control samples. 
5 To confirm the putative differential gene 

regulation, the amplified L36 band was recovered, 
reamplified, and used to probe Northern RNA blots 
which were prepared with the original liver RNA 
samples. FIG. 2 shows the results of one such 

10 Northern blot analysis, in which the steady messages 
, corresponding to cDNA band L3 6 are shown to be 

significantly increased in RNA samples derived from 
both ob/ob and db/db mice compared to lean littermate 
controls. Thus, this study confirmed the putative 

15 differential regulation which had been suggested by 
the differential display result. 

The reamplified fragment corresponding to band 
L3 6 was subcloned into a cloning vector and sequenced, 
as described, above, in Section 6.1. Plasmid DNA from 

20 four independent transf ormants was sequenced. All 
four plasmids were shown to contain the same insert 
and a consensus sequence of the four sequences was 
compiled and is shown in FIG. 3A. 

A database search with this consensus sequence 

25 resulted in an alignment with greater than 99% 

identity to a mouse stearoyl-Co-A desaturase gene, 
SCD1 (Ntambi, J.M. et al . , 1988, J. Biol. Chem. 
263:17291-17300; Kaestner, K.H. et al . , 1989, J. Biol. 
Chem. 264.: 14755-14761) , which encodes an enzyme that 

30 converts saturated fats to mono-unsaturated fats in 
the liver (FIG. 3B) . 

Mouse stearoyl-Co-A-desaturase mRNA is induced in 
liver upon feeding of fasted animals (Ntambi, J.M. et 
al., supra ; Ntambi, J.M., 1992, J. Biol. Chem. 

35 26.7:10925-10930) . Further, in studies of lean versus 
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obese mice, rats and chickens, stearoyl-Co-A enzymatic 
activity has consistently been reported to be higher 
in fat than lean animals (Esner, M., 1979, Biochem. J. 
180:551*558; Wahle, K.W.J, and Radcliffe, J.D., 1977, 
5 Lipids 12:135-139; Legrand, P. et al . , 1987, Comp. 

Biochem. Physiol. 87B: 789-792 ) . Additionally, it has 
been shown that stearoyl-Co-A activity is higher in 
chickens than turkeys (Kouba, M. et al . , 1993, Comp. 
Biochem. Physiol. 105A : 359-362) . It is considered 

10 that turkeys are a low fat animal as compared to 

chickens. Thus, it is likely that the stearoyl-Co-A 
enzyme is involved in such body weight regulating 
processes as control, metabolism and storage of 
dietary components . 

15 Therefore, by utilizing the genetic obesity 

paradigms described in this Section and in Section 
5.1.1.1, above, a differentially regulated gene, the 
mouse stearoyl-Co-A gene, involved in body weight 
regulation has been identified, thereby corroborating 

20 the usefulness of such paradigms in identifying genes 
important to body weight disorders, and/or body weight 
or appetite regulation. 



In the Example presented in this Section, the 
short term appetite control paradigm, as described, 
above, in Section 5.1.1.1, is utilized to identify 
30 gene sequences which are differentially expressed and 
which may contribute to body weight disorders and/or 
may be involved in such processes as body weight 
regulation or appetite modulation. 



25 



EXAMPLE : 



IDENTIFICATION OF GENES 
DIFFERENTIALLY EXF RETSEI IN 
RESPONSE TO SHORT 7EPJ* A? PETITE 
CONTROL PARADIGMS 



35 
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7.1. MATERIALS AND METHODS 
Short term appetite control paradigm: 45 male 
C57B1/6J mice 8 weeks of age were received from 
Jackson labs. The animals were randomized into three 
5 groups of 15 mice each, and housed individually on 
normal mouse chow (West, D.B . et al . , 1992, Am. J. 
Physiol. 262 :R1025-R1032) ) for 1 week prior to the 
initiation of the study. Group 1 mice (Control) were 
maintained on ad lib mouse chow up until the time of 

10 sacrifice. Group 2 mice (Fasted) were fasted for 24 
hours prior to sacrifice (water continuously 
available) . Group 3 mice (Fasted-Ref ed) were fasted 
for 24 hours and then offered a highly palatable meal 
(mouse chow mixed with peanut butter) for 1 hour prior 

15 to sacrifice. All mice were weighed immediately 
before the initiation of the experiment and 
immediately afterward. All mice were sacrificed by C0 2 
asphyxiation. 

20 RT-PCR analysis : Quantitative RT-PCR was 

performed as follows. 1-2/xg of total RNA, prepared as 
described, above, in Section 6.1, was reverse 
transcribed with oligo dT (12 _ 18) primers and Superscript™ 
RNAase H-reverse transcriptase (Gibco-BRL, 

25 Gaithersburg, MD) . Briefly, RNA was combined with l/il 
oligo dT (500/ig/ml) in a total volume of lljil. The 
mixture was heated to 70° C. for 10 minutes and chilled 
on ice. After a brief centrif ugation RNA was reverse 
transcribed for 1 hour. Aliquots of the first strand 

30 cDNA were stored at -20° C. until just prior to use. 
Expression levels were determined by PCR 
amplification of serial dilutions of first strand 
cDNA. In this procedure, cDNA is serially diluted in 
water. The dilutions are then batch amplified by PCR 

35 using sequence-specific primers. All PCR reactions 
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are amplified under identical conditions. Therefore, 
the amount of product generated should reflect the 
amount of sequence template which was initially 
present. 5-10 fold dilutions of cDNA were used and 
5 enough dilutions were used such that the amount of 
product subsequently produced ranged from clearly 
visible, by UV illumination of ethidium bromide - 
stained gels, to below detection levels. The method 
described herein can distinguish 10-fold differences 
10 in expression levels. 

Primers were designed for the amplification of 
the sequenced amplified bands, which were chosen using 
the program OLIGO (National Biosciences, Plymouth, 
MN) . All quantitative PCR reactions were carried out 
15 in a 9600 Perkin-Elmer PCR machine (Perkin-Elmer) . 

Generally, amplification conditions were as follows: 
30-40 cycles consisting of a 95° C. denaturation for 30 
seconds, 72° C. extension for 1 minute, 50-60° C. 
annealing for 30 seconds. Following cycling, 
20 reactions were extended for 10 minutes at 72° C. 

Other procedures : All other tissue collection, 
RNA isolation, differential display, sequence 
analysis, and Northern procedures performed in the 
25 experiments described in this Section were as 
described, above, in Section 6.1. 

7.2. RESULTS 
Mice, as described, above, in Section 7.1, were 
30 utilized as part of short term appetite control 

paradigms. Briefly, C57B1/6J mice were divided into 
Control, Fasted, and Fasted-Refed groups, in order to 
identify genes which are differentially expressed in 
response to hunger and satiety. 
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The mice were weighed immediately before the 
initiation of the study and immediately prior to their 
sacrifice at the end of the study. Body weights (in 
grams) were as in Table 3, below: 

TABLE 3 





Control 


Fasted 


Fasted-Ref ed 


Before Study 


23 . 9+/-1 . 3 


23 .3+/-1 . 1 


23 . 2+/-1 . 4 


After Study 


24 .4+/-1.4 


19 .3+/-1 . 1 


21.7+/1-1-4 



10 



15 



20 



25 



30 



35 



Upon sacrifice, Control, Fasted, and Fasted-Ref ed 
tissues were collected and immediately frozen. The 
tissues collected were: hypothalamus, liver, small 
intestine, pancreas, stomach, and omental adipose 
tissue. RNA was collected from the tissue samples 
obtained and was subjected to differential display, as 
described, above, in Section 7.1. 

Utilizing such short term appetite control 
paradigms and differential display techniques, several 
gene sequences were identified. Data obtained from 
such sequences is summarized, below, in Table 4. The 
differential expression data identifying these gene 
sequences as corresponding to genes which may be 
involved in body weight disorders and/or body weight 
or appetite regulation is listed in the columns headed 
"Fasted" and "Ref ed" , depending on the paradigm in 
which differential expression of a given gene was 
analyzed. Further, the tissue in which the 
differential expression was observed is noted, as is 
the difference in expression of each gene in the 
experimental (either fasted or refed animals) versus 
control tissues. " It" indicates that gene expression 
is increased ( i.e. , there is an increase in the amount 
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of detectable mRNA produced by a given gene) in 
experimental versus control tissue, while "U" 
indicates that gene expression is decreased ( i.e. , 
there is an decrease in the amount of detectable mRNA 
5 produced by a given gene) in experimental versus 

control tissue. Table 4 also notes whether the gene 
sequence corresponds to a gene which had previously 
been identified, and additionally notes the figure in 
which the nucleotide sequence of the given sequence is 
10 listed. 



TABLE 4 



15 


Gene 


Fasted 


Refed 


Previously 
Known | 


Sequence 




P3 (SEQ. ID 
NO:4, 39) 


ft Pancreas 




Yes 


FIG. 4 


20 


P13 (SEQ. ID 
NO:7, 40-42) 


t Pancreas 




Yes 


FIG. 5 


F5 (SEQ. ID 
NO: 12) 


t Adipose 




Yes 


FIG. 6 


25 


L7/L21 (SEQ. 
ID NO: 18) 


ft Liver 




Yes 


FIG. 9 


H27 (SEQ. ID 
NO:49) 


fl Hypothalamus 




Yes 


FIG. 20 



In addition to the tissues, listed above in Table 
30 4, in which the initial differential expression was 

observed, further analysis of the tissue distribution 
of gene expression of the differentially expressed 
genes has been conducted. Such an analysis consisted 
of either Northern or RT-PCR studies, or both, as 
35 described, above, in Section 7.1. The tissue 
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distribution information obtained for the above-listed 
genes is reported, above, in Table 1 of Section 5.2.1. 

Database searches revealed that the genes listed 
in Table 4, above, identified via the short term 
5 appetite control paradigms described herein have 

previously been identified . Specifically, P3 (SEQ ID 
NO.: 4) represents the gene encoding mouse glutamine 
synthetase (FIG. 4); P13 (SEQ ID NO.:7, 40-42) 
represents the gene encoding mouse islet regenerating 

10 protein (FIG. 5); F5 (SEQ ID NO.:12) represents the 

gene encoding mouse a-amylase (FIG. 6) ; L7/L21 (SEQ ID 
NO.: 18) represents the gene encoding mouse cytochrome 
c oxidase subunit I (FIG. 9) ; and H27 (SEQ ID NO: 4 9) 
represents the gene encoding mouse autoantigen La 

15 (FIG. 20) . 



genetic obesity paradigms, as described, above, in 
Section 5.1.1.1, were utilized to identify gene 
sequences which are differentially expressed ar.: wr.ich 
may contribute to body weight disorderr ar.d :r rr.oy be 



8. EXAMPLE: IDENTIFICATION OF GENES 



DIFFERENTIALLY EXPRESSED IN 
RESPONSE TO GENETIC OBESITY 
PARADIGMS 



20 



In the Example presented in this Section, 



25 



involved in such processes as body we;^r.: 
or appetite modulation. 



r f* i u 1 a • :cr, 



8.1. MATERIALS AND METHODS 



30 



Genetic obesity paradigms : Animals and 

animal treatments were as described above, in Section 



6.1. 



35 



Other procedures : All other tissue collection, 
RNA isolation, differential display and sequence 
analysis procedures performed in the experiments 
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described in this Section were as described, above, in 
Section 6.1. RT-PCR procedures were as described, 
above, in Section 7.1. 

5 8.2. RESULTS 

Ob/ob , db/db . and lean littermate control mice, 
as described, above, in Section 8.1, were utilized as 
part of genetic obesity paradigms . The mice were 
weighed at the end of the study, immediately prior to 
10 sacrifice . 

Upon sacrifice, tissues were collected from the 
four groups ( i .e . , ob/ob , db/db and lean littermate 
controls) and immediately- frozen. The tissues 
collected were: hypothalamus, liver, small intestine, 
15 pancreas, stomach, epididymal or uterine fat pads, and 
skeletal muscle. RNA was collected from the tissue 
samples obtained and was subjected to differential 
display, as described, above, in Section 8.1. 

Utilizing such genetic obesity paradigms and 
20 differential display techniques, several gene 
sequences, corresponding to both unique (i.e. , 
previously unknown) and known genes were identified. 
Data obtained from such sequences is summarized, 
below, in Table 5. The differential expression data 
25 identifying these gene sequences as corresponding to 
genes which may be involved in body weight disorders 
and/or body weight or appetite regulation is listed in 
the columns headed " Ob/ob and " Db/db " , depending on 
the paradigm in which differential expression of a 
30 given gene was analyzed. Further, the tissue in which 
the differential expression was observed is noted, as 
is the difference in expression of each gene in the 
experimental (either ob or db animals) versus control 
tissues. "It" indicates that gene expression is 
35 increased ( i.e. , there is an increase in the amount of 
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detectable mRNA produced by a given gene) in 
experimental versus control tissue, while H J H 
indicates that gene expression is decreased ( i.e. . 
there is an decrease in the amount of detectable mRNA 
5 produced by a given gene) in experimental versus 
control tissue. Further, "+ M indicates that gene 
expression was activated in experimental versus 
control tissue, i.e. , mRNA was detectable in 
experimental tissue whereas none was detectable in 
10 control tissue. 

Table 5 also notes whether the gene sequence 
corresponds to a gene which had previously been 
identified, and additionally notes in which figure the 
nucleotide sequence of the given sequence is listed. 



15 

TABLE 5 





Gene 


ob /ob 


db/db 


Prev. 
known 


Seq. 


20 


F4 9 

(SEQ. ID 
NO : 3 4 ) 




+Adipose 


No 


FIG. 14 




murine 


UAdipose 


HAdipose 


No 


FIG. 16 


25 


CS m (SEQ. 
ID 

NO : 3 6 ) 












L31/F74 


UAdipose 


UAdipose 


Yes 


FIG. 8 A- 




(SEQ. ID 








8B 


30 


NO: 16) 












L7/L21 


ItLiver 


flLiver 


Yes 


FIG. 9 




(SEQ. ID . 












NO: 18) 











35 
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L2 9 


U Liver 




Yes 


FIG. 10 




(SEQ . ID 












NO: 20) 










5 


L38 

NO: 22, 
43) 


flLiver 


flLiver 


Yes 


FIG. 11 


10 


L3 7 

(SEQ. ID 
NO: 25 , 
44-45) 


flLiver 




Yes 


FIG. 12 




L34 (SEQ 


flLiver 




Yes 


FIG. 22 


15 


ID 

NO: 57) 











*The mouse C5 sequence was first identified via sequence 
homology. C5 was then subsequently tested in ob and db mice, at 
20 which time it was identified to represent a differentially expressed 
gene sequence . 

In addition to the tissues, listed above in Table 
5, in which the initial differential expression was 
observed, further analysis of the tissue distribution 
25 of gene expression of the differentially expressed 

genes has been conducted. Such an analysis consisted 
of either Northern or RT-PCR studies, or both, as 
described, above, in Section 8.1. The tissue 
distribution information obtained for the above-listed 
30 genes is reported, above, in Table 1 of Section 5.2.1. 

As described above, several of the gene sequences 
identified via the genetic obesity paradigms of the 
invention represent previously unknown genes. These 
include F49 (SEQ ID NO.:34) and C5 (SEQ ID NO.: 36). 
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A putative full length cDNA clone (FIG. 14; SEQ 
ID NO.: 34) corresponding to the entire coding sequence 
of the fat-specific F49 gene has been isolated 
utilizing the techniques described, above, in Section 
5 5.1.1.2. Hybridization of F49 nucleotide sequences to 
genomic DNA of several divergent organisms reveals 
that the F49 gene is conserved in most mammals, 
including monkeys and humans, while the gene appears 
to be absent from chicken and yeast. 

10 The F49 coding sequence predicts a 9 6 amino acid 

protein (SEQ ID NO.:35), shown in FIG 14. The 
sequence strongly suggests that the F49 gene product 
is a secreted protein. Take, for example, the F49 
gene product hydropathy plot depicted in FIG. 15. The 

15 strongly hydrophobic amino-terminal portion of the 

amino acid sequence is highly suggestive of a signal 
sequence characteristic of secreted proteins. 

A putative full-length cDNA clone (FIG. 16A-16B; 
SEQ ID NO.: 36) corresponding to the entire coding 

20 sequence of the murine C5 gene has been isolated 

utilizing the techniques described, above, in Section 
5.1.1.2. The C5 coding sequence predicts the protein 
whose amino acid sequence is shown in FIG. 16A-16B 
(SEQ ID NO.:37) This amino acid sequence bears a 50% 

25 identity to the mouse brown fat uncoupling protein, 
and thus represents a newly discovered brown fat 
uncoupling homologue. In order to further 
characterize the C5 gene product, a C5 cDNA has been 
cloned into yeast and mammalian (pcDNA3; Invitrogen) 

30 expression vectors to test whether the C5 gene product 
exhibits uncoupling activity. 

Additionally, database searches have revealed 
that several of the genes identified via the genetic 
obesity paradigms described herein have previously 

35 been identified (see Table 5) . For example, L31/F74 
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(SEQ ID NO.: 16) represents the gene encoding the mouse 
major urinary protein II (FIG. 8A-8B) ; L7/L21 (SEQ ID 
NO.: 18) represents the gene encoding mouse cytochrome 
c oxidase subunit I (FIG. 9); L29 (SEQ ID NO.:20) 
5 represents the gene encoding mouse testosterone 15-a 
hydroxylase (FIG. 10); L38 (SEQ ID NO.:22, 43) 
represents the gene encoding mouse 24p3, a lipocalin 
family member of unknown function (FIG. 11) ; L37 (SEQ 
ID NO.: 25, 44-45) represents the gene encoding mouse 
10 p6-5, a gene which is 86% homologous to rat 

preproelastase I (FIG. 12); L34 (SEQ ID NO:57; FIG. 
22) represents the mouse homolog of a human gene, 
which will be referred to herein as the "human L34 
gene". The human L34 gene, whose sequence can be 
15 found in Hou et al. (Hou et al. , 1994, In Vitro Cell 
Dev. Biol. Anim. 30A: 111-114, which is inocorporated 
herein by reference in its entirety) was cloned from a 
hepatoblastoma cell line and encodes a leucine-rich 
transmembrane protein. The mouse L34 gene exhibits an 
20 82% identity with the human L34 gene at the nucleotide 
level, and the derived amino acid sequence of the 
mouse L.3 4 gene product exhibits an 86% identity with 
the human L3 4 gene product. 

Several of the previously identified genes which 
25 these studies have demonstrated to be differentially 
expressed in obese versus lean control subjects have 
never before been associated with processes involved 
in body weight regulation, appetite regulation, or 
body weight disorders, such as obesity. Among these 
30 genes are the genes encoding the mouse major urinary 
protein II (L31/F74), mouse testosterone 15-a 
hydroxylase (L29) , mouse 24p3 (L38) , mouse p6-5 (L37) , 
and human and mouse L34 proteins. 



35 
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15 



20 



25 



30 



35 



9. EXAMPLE: IDENTIFICATION OF GENES 

DIFFERENTIALLY EXPRESSED IN 
RESPONSE TO SET POINT PARADIGMS 

In the Example presented in this Section, set 

point paradigms, as described, above, in Section 

5.1.1.1, were utilized to identify gene sequences 

which are differentially expressed and which may 

contribute to body weight disorders and/or may be 

involved in such processes as body weight regulation 

or appetite modulation. 

9.1. MATERIALS AND METHODS 
Set point paradigms : 45 male C57B1/6J mice 8 
weeks of age were received from Jackson labs. The 
animals were randomized into 3 groups of 15 mice each, 
and housed individually on normal mouse chow for 1 
week prior to the initiation of the study. Group 1 
mice (Control) were maintained on ad lib mouse chow 
for an additional five days in order to calculate the 
daily food intake. Group 2 mice (underweight) then 
received a fraction of normal food intake (60-90%) so 
as to reduce and maintain their body weight at 
approximately 80% of control values. Group 3 mice 
(overweight) were given a cafeteria diet so as to 
bring their body weights to 125% of control. The 
three groups of 15 mice each were then sacrificed by 
C0 2 euthanasia and tissues were immediately collected. 
Body weights of the three groups of 15 mice were taken 
at the time of sacrifice. 

Other p rocedures : All other tissue collection, 
RNA isolation, differential display, sequence 
analysis, and Northern procedures performed in the 
experiments described in this Section were as 
described, above, in Section 6.1. RT-PCR quantitative 
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analysis was performed as described, above, in Section 
7.1. 

9.2. RESULTS 

5 Mice, as described, above, in Section 9.1, were 

utilized as part of set point paradigms. The mice 
were weighed at the end of the study, immediately 
prior to sacrifice. 

Upon sacrifice, tissues were collected from the 
10 three groups ( i.e. , Control, Underweight and 

Overweight) and immediately frozen. The tissues 
collected were hypothalamus, liver, small intestine, 
pancreas, stomach, epididymal fat pads, and skeletal 
muscle. RNA was collected from the tissue samples 
15 obtained and was subjected to differential display, as 
described, above, in Section 9.1. 

Utilizing such set point paradigms and 
differential display techniques, gene sequences L57, 
F84 and L31/F74, corresponding to known genes, were 
20 identified, as summarized in Table 6, below. In 

addition to differential expression information, Table 
6 also notes in which figure nucleotide sequences of 
the identified genes are listed. 



TABLE 6 

25 





Gene 


Overweight 


Underweight 


Previously 
known 


Seq. 


30 


L57 (SEQ. ID 
NO:29, 46^*8) 




fl Liver 


Yes 


FIG. 13 




F84 (SEQ. ID 
NO:52, 54) 




fl Adipose 


Yes 


FIG. 21 



35 
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L31/F74 




I Liver 


Yes 




(SEQ. ID 








FIG. 8A-8B | 


NO: 16) 











Database searches have revealed that the L57 (SEQ 
ID NO: 29, 4 6-48) gene sequence corresponds to a 
previously known gene, the gene encoding the mouse 
orphan nuclear hormone receptor (FIG. 13) and the F84 
(SEQ ID NO: 53, 55) gene sequence encodes mouse 
cytochrome p450 IID (FIG. 21). As discussed, above, 
in Section 8.2, the L31/F74 gene sequence encodes the 
mouse major urinary protein II. Interestingly, these 
gene products have never before been associated with 
processes involving body weight or appetite regulation 
or body weight disorders. 

10. EXAMPLE: ISOLATION AND CHARACTERIZATION OF 
■ THE HUMAN C5 GENE. 

Described in the Example presented in this 

Section, is the cloning and characterization of a 

human homologue of the mouse C5 gene. The mouse C5 

gene is a homolog of the mouse brown fat uncoupling 

protein, and, further, as demonstrated, above, in 

Section 8, the gene is differentially expressed in 

obese versus lean littermate controls. Thus, C5 can 

represent a gene whose product is involved in body 

weight disorders, and/or processes involved in body 

weight or appetite regulation. Likewise, human C5 can 

also represent a gene whose product is involved in 

such disorders and/or processes in humans. 

10.1. MATERIALS AND METHODS 
Human C5 was cloned from a human fetal spleen 
library (Stratagene) . The probe used for the 
hybridization was a 0 . 9 kb partial cDNA clone of the 
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mouse C5 gene. Filters used for hybridization were 
NitroPlus 2000 (Micron Separations, Inc.)« 
Hybridization and washing conditions were as per 
manufacturer^ instructions for low stringency 
5 hybridizations, except that the hybridization steps 
were carried out at 42°C. 

Northern analysis was performed as described, 
above, in Section 7.1. 

10 10.2. RESULTS 

The mouse C5 gene sequence was used, in 
conjunction with the methods taught, above, in 
Sections 5.2.1 and 10.1, to isolate a cDNA clone 
corresponding to human C5. The nucleotide sequence of 

15 the human C5 cDNA clone is shown in FIG. 17A-17B 
(bottom line; SEQ ID NO:38). The FIG. 17A-17B 
nucleotide sequence (SEQ ID NO: 38) encodes all but the 
first ten amino acids of the human C5 gene product. 

The human C5 amino acid sequence derived from the 

20 cDNA sequence is also depicted in FIG. 17A-17B (top 
line; SEQ ID NO: 56). Using standard well known 
techniques, the human C5 nucleotide sequence can be 
used to isolate the most 5 • end of the C5 coding 
sequence. Alternatively, in order to obtain a full 

2 5 length C5 protein, a nucleotide sequence encoding the 

first 10 amino acids of the mouse C5 protein may be 
ligated to the 5 f end of the human cDNA sequence 
disclosed herein, such that an entire linear C5 coding 
sequence is generated and a full length C5 protein can 

3 0 be expressed using standard techniques. Further, the 

entire amino acid sequence of such a C5 protein can be 
synthesized by, again, utilizing standard techniques 
well known to those of skill in the art. 

The amino acid sequence of such a C5 protein is 
35 depicted in FIG. 18 (SEQ ID NO: 51). The first ten 
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amino acid residues represent sequence derived from 
the mouse C5 protein while the rest of the amino acid 
residues represent sequence taken from the human C5 
protein. Given the high degree of amino acid 
5 conservation which exists between the mouse and the 

human C5 amino acid sequences, it is expected that the 
protein depicted in FIG. 18 represents a functional C5 
protein . 

A Northern analysis of the tissue distribution of 
10 human C5 mRNA transcripts was performed, as depicted 
in FIG. 19. RNA from brain, heart, placenta, lung, 
liver, muscle, kidney, and pancreas tissues were 
isolated and analyzed for C5 expression. As in the 
mouse, the approximately 1.8 kb human CS transcript 
15 exhibits a complex pattern of tissue distribution, 
with mRNA accumulation appearing to be greatest in 
muscle tissue. 



11 . DEPOSIT OF MICROORGANISMS 
20 The following microorganisms were deposited with 

the Agricultural Research Service Culture Collection 
(NRRL) , Peoria, Illinois, on August 23, 19^4 and 
assigned the indicated accession numbers: 

25 Microorganism NRRL Accession N: . 

famf049a B-21318 
fahsOOBa B-21320 

The present invention is not to be limited in 
scope by the specific embodiments described herein, 
which are intended as single illustrations of 
individual aspects of the invention, and functionally 
equivalent methods and components are within the scope 
of the invention. Indeed, various modifications of 
the invention, in addition to those shown and 



30 



35 
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described herein will become apparent to those skilled 
in the art from the foregoing description and 
accompanying drawings. Such modifications are 
intended to fall within the scope of the appended 
5 claims. 



10 



15 



20 



25 



30 
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International Application No: PCT/ / 



MICROORGANISMS 

Option*! Sheet in connection with the microorganism referred to on page 127 . lines 20-36 of the description * 



A. IDENTIFICATION OF DEPOSIT ' 

Further deposits are identified on an additional sheet 



Name of depositary institution * 

AfricultursJ Research Culture CoDecUon (NRRL) 
International Depositary Authority 



Address of depositary institution (including postal code and country) 
1815 N. University Street 
Peoria, IL 61604 
US 



Date of deposit ' August 23. 1994 Accession Number ' B 21318 



B. ADDITIONAL INDICATIONS fU.^ blxnk if no. applicable). ThU in/arm*™ u continued on . 



C. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE * « ■ , , 



D. SEPARATE FURNISHING OF INDICATIONS * Oe*~bm* if «» 



The indication* Ustad b*4ow will be eubmlnad to tha international Buruu later * (Specify the gmril natura of tha indicationa a o 
Accetslon Number of Deposit") 



E.^Thts sheet was received with the IntemarionaJ application when filed (to be checked by the receiving Office) 

v^/Autriorized Officer) 
□ The date of receipt (from the applicant) by the Intenutional Bureau • 



was 



(Authorized Officer) 

r-orm PCT/HO/134 (January 198l) ' 
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International Application No: PCT/ / 



Form PCT/RO/1 34 (com.) 

Agricultural Rasa arch Culture Collection (NRRLJ 
International Depositary Authority 

1815 N. University Street 

Peoria, tL 61604 

US 



Accession No, Date of Deposit 

B-21320 August 23, 1994 
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WHAT IS CLAIMED IS; 

1. An isolated nucleic acid molecule which 
encodes a gene that is differentially expressed in 

5 body weight disorders, said gene containing the 
following nucleotide sequence: 

F49 (SEQ ID NO. : 34) , 

C5 (SEQ ID NO . :36) , or 

human C5 (SEQ ID NO.:38), 
10 or the nucleotide sequence of a gene or gene fragment 
contained in a cDNA within the following E . coli clone 
as deposited with the NRRL: famf04 9a (NRRL Accession 
No. B-21318) , or fahs005a (NRRL Accession No. B- 
21320) . 

15 

2. An isolated nucleic acid molecule which 
encodes a gene that is differentially expressed in 
body weight disorders, which hybridizes under 
stringent conditions to the nucleotide sequence of 

2 0 Claim 1 or its complement, or to the gene or gene 

fragment contained in the cDNA of the E . coli clone of 
Claim 1 as deposited with the NRRL. 

3. An isolated nucleic acid molecule which 

25 encodes: an amino acid sequence depicted in FIG. 14 

(SEQ ID NO:35), FIG. 16A-16B (SEQ ID NO:37), FIG. 17A- 
17B (SEQ ID NO:56) or FIG. 18 (SEQ ID NO:51); an amino 
acid sequence encoded by the nucleotide sequence of 
Claim 1 or its complement, or an amino acid sequence 

30 encoded by the gene or gene fragment contained in the 
cDNA of the E. coli clone of Claim 1 as deposited with 
the NRRL. 

4. A nucleotide vector containing the 
5 nucleotide sequence of Claim 1, 2 or 3 . 
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5. An expression vector containing the 
nucleotide sequence of Claim 1, 2 or 3 in operative 
association with a nucleotide regulatory element that 
controls expression of the nucleotide sequence in a 

5 host cell . 

6. A genetically engineered host cell 
containing the nucleotide sequence of Claim 1, 2 or 3 . 

10 7. A genetically engineered host cell 

containing the nucleotide sequence of Claim 1, 2 or 3 
in operative association with a nucleotide regulatory 
element that controls expression of the nucleotide 
sequence in the host cell. 

15 

8 . An isolated gene product encoded by the 
nucleic acid molecule of Claim 1, 2, or 3. 

9. The isolated gene product of Claim 8 wherein 
20 the gene product is a protein comprising amino acid 

sequence SEQ ID NO: 35, SEQ ID NO: 37, SEQ ID NO: 51 or 
SEQ ID NO: 56 . 

10. An antibody that immunospecif ically binds 
2 5 the gene product of Claim 8. 

11. A method for diagnosing body weight 
disorders, comprising detecting, in a patient sample, 
a gene transcript or its gene product which is 

30 differentially expressed in a body weight disorder 
relative to a non-body weight disorder. 

12. The method of Claim 11 in which the gene is 
induced in genetically obese individuals. 

35 



- 132 - 



WO 96/05861 



PCT/US95/10918 



13 . The method of Claim 12 in which the gene 
encodes stearoyl -CoA-desaturase , cytochrome c oxidase 
subunit I, a 24p3 protein, a p6-5 protein, a homolog 
of mouse brown fat uncoupling protein, an F4 9 protein 

5 a C5 protein, or an L34 protein. 

14. The method of Claim 11 in which the gene is 
repressed in genetically obese individuals. 

10 15. The method of Claim 14 in which the gene 

encodes a homologue of major urinary protein II or a 
testosterone hydroxylase protein. 

16. The method of Claim 11 in which the gene is 
15 induced by fasting. 

17 . The method of Claim 16 in which the gene 
encodes cytochrome c oxidase subunit 1, glutamine 
synthetase, islet regenerating protein, or alpha - 

20 amylase. 

18. The method of Claim 11 in which the gene is 
repressed by fasting. 

25 19. The method of Claim 18 in which the gene 

encodes an H27 protein or a homologue of rat 
cytochrome p4 5 0 enzyme. 

20. The method of Claim 11 in which the gene is 
30 repressed by refeeding of a fasted individual. 

21. The method of Claim 11 wherein the gene is 
induced by refeeding of a fasted individual. 

35 
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10 



22. The method of Claim 11 in which the gene is 
induced in underweight individuals. 

23. The method of Claim 22 in which the gene 
encodes an orphan nuclear hormone receptor protein, a 
mouse major urinary protein II or an F84 protein. 

24 . The method of Claim 11 in which the gene is 
repressed in underweight individuals. 

25- The method of Claim 11 in which the gene is 
induced in overweight individuals. 

26. The method of Claim 11 in which the gene is 
15 repressed in overweight individuals. 

27. A method for treating a body weight disorder 
in a mammal, comprising administering to the mammal a 
compound that modulates the synthesis, expression or 

20 activity of a target gene or target gene product so 
that symptoms of the body weight disorder are 
ameliorated . 

28. The method of Claim 27 in which the compound 
25 inhibits the synthesis, expression or activity of the 

target gene or the target gene product. 

29. The method of Claim 27 in which the compound 
is an antisense or ribozyme molecule that blocks 

30 translation of the target gene. 

30. The method of Claim 27 in which the compound 
is complementary to the 5' region of the target gene 
and blocks transcription via triple helix formation. 

35 
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31. The method of Claim 27 in which the compound 
is an antibody that neutralizes the activity of the 
target gene product . 



WO 96/05861 



1/22 



PCT/US95/10918 




SUBSTITUTE SHEET (RULE 26) 



WO 96/05861 



2/22 



PCTYUS95/10918 



C ob C db 



12 3 4 

FIG. 2 



SUBSTITUTE SHEET (RULE 26) 



WO 96/05861 



PCT/US95/10918 



3/22 



10 20 30 40 

■ ■ ' ■ 1 ■ ' ■ 1 1 ' * i ■ 1 1 ■ ■ ■ 1 ■ 1 ■ ■ 1 ■ ■ ■ ■ 1 ' ■ ■ ■ 1 ' ■ i ■ 1 

GGGCAGTACAACCAGATCCACTTTTATTAGGAACAAATAC 40 

AATCTCAATCAGTACAAGTAGGCTTCAAGAGTTGATATTA 80 

ATGGAAATCATCCAAATTACACTTGGGTCACAAATAATTA 120 

CCCCACATAAAAAGGGAAAAAAAAAoTCTCATTCAGGGgA 160 

AGGG AAAGG T T TCCTGC AATGG T T T TCATGGCAGTGGG T A 200 



210 220 230 240 

* » ' ' I » i t i I it i i I t t i i t i i t i 1 i i i i I tiii i i i i i 1 

GG TAGTC T TGC AC T T T gGAC TGG TCAT ATCTG TCAG TC TC 240 
TGGGCAGAGCAAA 253 



FIG.3A 



157 CCTGAATGAGATTTTTTTTTTCCCTTTTTATGTGGGGTAATTATTTGTGACCCAACTCTA 98 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
3505 CCTGAATGAGATTTTTTTTTTCCCTTTTTATCTGGGCTAATTATTTCTCACCCAAC1CTA 3564 

97 AT T TGGATGAT T TCCAT T AAT ATCAACTCT TGAAGCC T AC T TG T AC I GAT T G AGA T TCI A 38 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIinillllllllllll 
3565 ATTTGGATGATTTCCATTAATATCAACTCTTGAAGCCTACTICIACTCAITCACAHCTA 3624 

37 TTTGTTCCTAATAAAAGTGGATCTGGTTGTACTGCC 2 

llllllllllllllilllllllllllllllllll I 
3625 TTTGTTCCTAATAAAAGTGGATCTGGTTGTACTGTC 3660 



217 CAAAGTGCAAGAC TACCTACCCACTGCCATGAAAACCAT TGCAGGAAACC T T TCCC T TCC 158 

inillllllllllllllllllllllllllllllllllllllllllllllllllllllli 
3447 CAAAGTGCAAGACTACCTACCCACTGCCATGAAAACCATTGCAGGAAACCTT TCCCT TCC 3506 

157 CCTGAATGAGATTTTTTTTTTCCCTTTTTATGTGG 123 

II llllllli I III I II II 
3507 TGAATGAGATTTTTTTTTTCCCTTTTTATGTGGGG 3541 



FIG.3B 

SUBSTITUTE SHEET (RULE 26) 



WO 96/05861 



PCT/US95/10918 



4/22 



CO 

o 

co csi 




SUBSTITUTE SHEET (RULE 26) 



WO 96/05861 PCI7US95/10918 



5/22 



69 ATCTCTTTTGGCAGAACATGAATGCAGGTCACCTGGTGTCAATACTCAGCCAGGCTGAGA 128 

i muni iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiuiiiiiiii 

1848 AGCTCTTTTGCCAGAACATGAATGCAGGTCACCTGGTGTCAATACTCAGCCAGGCTGAGA 1907 
129 GCAACTTGGTGGCCTCGCTGGTTAAGGAGAGTGGTACTACAGCTTCCAATGTCTGGACTG 188 

mini iiiiiiiiiiiniiiiiiiiiniiiiiiiiiiiiiiiiiiiiiiiiiiii 

1908 GCAACTTTGTGGCCTCGCTGGTTAAGGAGAGTGGTACTACAGCTTCCAATGTCTGGACTG 1967 

189 GACTTCATGACCCTAAAAGT 208 

llllllllllllllllllll 
1968 GACTTCATGACCCTAAAAGT 1987 



209 AACCGTCGNTGGCACTGGAGCAGTGGCTCCCTATTTCTCTTCAAGTCATGGGCCACTGGA 268 

llllllll illlllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
2979 AACCG TCGT TGGCAC TGGAGCAG TGGCTCCCT AT T TCTC T TCAAG TCATGGGCCACTGG A 3038 

269 GCTCCAAGCACTGCCAACCGGGTT 292 

millllllllMMIIII I I 
3039 GCTCCAAGCACTGCCAACCGTGGT 3062 



8 AACGCCTATGGTTCCTACTGTTATTATCTAATTGAAGACCGCTTGACCTGGGGGGAGGCT 67 

llllllllllllllllllllllimillllllllllim IMIIIMIMMMMI 
1563 AACGCCTATGGTTCCTACTGTTATTATCTAATTCAAGACCGTTTGACCTGGGGGGAGGCT 1622 

68 GATCT 72 

III I 
1623 GATGT 1627 



318 AGGATACAAAAMTGGGAGGACGAAAACTGTGAGGCACAGTACTCCTTGGTCTTGAAGTT 377 

II lllllllllllll lllllllllllllllllllllllllllllll Mil lllll 
3279 AGC AT ACAAAAAATGG AAGG ACG AAAAC TG TG AGGCACAG TACTCCTTTGTC TGCAAG T T 3338 

378 CAGAGGCTAA 387 

lllll llll 
3339 CAGAGCCTAA 3348 
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2 GGCTTAGTCAGCCACTTTACAGACTGGTCTTCCTGCTGGCACATACTGTGATGTCATCTC 61 



1 355 GGCTTTGTCAGCCACTTTACAGACTGGTCTTCCTGCTGGCACATACTGTGATGTCATCTC 1414 
62 TGGAGATAAGGTCGATGGCMTTGCACTGGACTTAGAGTGMTGTTGGCAGTGATGGCAA 121 



1415 TGG AG AT AAGG TCG ATGGCAAT TGC AC TGG AC TTAGAGTGAATGT TGGC AG TG ATGGC AA 1474 
122 AGCTCACTTTTCCATTAGTAACTCTGCTGAGGACCCATTTATTGCMTCCATGCTGACTC 181 



1 475 AGCTCACTTTTCCATTAGTAACTCTGCTGAGGACCCATTTATTGCAATCCATGCTGACTC 1 534 
182 AAAATTGTAAGAATCTATATTAAAGAGATTTGGATTAGGAA 222 
1535 AAAATTGTAAGAATCTATATTAAAG^ 1 575 

FIG.6 

24 AGAGTCAAGGGCTAGTGCGCACCGCAGCCAGCGCCCAGTACCCTGGCGTTCTGGGTACCA 83 

III I I Ml II IN I II I ll |( II II 1 1 HI H I 

247 AGATCCAAGGCGAGTTCCCGATCACCAGCGGCATCACCTACAAAGGTGTCC1GGGGACAA 306 

84 TCCTAACCATGGTGCGCACTGAGGGTCCACGCAGCC TC T ACAATGCGC 1GC T CCCCGCCC 143 

307 TCACCACCCTGGCAAAAACGGAAGGGCCCCTGAM 1CCCCCCCCCCC 366 

144 TGCAGCGCCAGATGAGCCT TGCCTCCGTCCGCAT ^GCC | ^ J ^ ' M ^^^^^ | 203 

367 TCCAGAGACAAATCAGCT TCGCCTCG^ 1 C I ACGACACCC 1 CCAGGAG T 426 

204 TCTACACCMGGGCTCAGAGCATGGAGGCATCGGGAGCCGCCTCCTGGCAGGTAGCACCA 263 

427 TCTTCACCTCGGGGGAAGAAACACCGAGTTTAGGAAGCAAGATCTCGGCCGGCCTAACAA 486 

264 CAGGTGCCCTGGCCGTGGTTGTAGCCCAGCCTACAGATGTGGTAAAGGTCCGCTTCCAGG 323 



I II I I mi iii i ii him inn ii ii ii m i i ii i „, c 

487 CTGGAGGCGTGGCGGTGTTCATCGGGCAGCCCACAGAGGTCGTGAAAGTCAGGCTGCAAG 546 



324 CTCCAGGCC 332 

I I III 
547 CGCAGAGCC 555 
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42 AAGATGCTGCTGCTGCTGCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGM 101 

I II I lllllllllllllllllllinillllllllllllllllllllllllllllll 

59 ATGAAGATGCTGCTGCTGCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGAA 118 

102 GCTAGTTCTACGGGAAGGAACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATT 161 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

119 GCTAGTTCTACGGGAAGGAACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATT 178 

162 ATCCTGGCC TC TGACAAAAGAGAAAAG AT AGAAG AT AATGGCAACTT T AGAC TTTTTCTG 221 

IIMIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIilllllllllllll 

179 ATCC TGGCC TC TGACAAAAGAG AAAAGATAGAAGAT AATGGCAACT T T AGAC T T T T TC TG 238 

222 GAGCAAATCCATGTCTTGGAGAAATCCTTAGTTCTTAAAATCCATCCTGTAAGAGATGAA 281 

llllimillllllimilllllllllllllllllM Hill lillllllllllll 

239 GAGCAAATCCATG TCT TGGAGAAATCCT TACT TCTTAAAT TCCATACTGT AAGAGATGAA 298 

282 GAGTGCTCCGAATTATCTATGGTTGCTGACAAAACAGAAAAGGCTCGTGAATATTCTGTG 341 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

299 GAG TGCTCCGAAT T ATCTATGGT TGCTGACAAAACAGAAAAGGCTGGTGAAT ATTCTG TG 358 

342 ACGTATGATGGATTCAATACATTTACTATACCTAAGACAGACTATGATAACTTTCTTATG 401 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

359 ACGTATGATGGATTCAATACATTTACTATACCTAAGACAGACTATGATAACTTTCTTATG 418 

402 GCTCATCTCATTMCGAAAAGGATO^AAACCTTCCAGCTGATGGGGCTCTATGGCCGA 461 

lllllfllllllllllllllllllllllllllllllllllllllllllllllllllllll 

419 GCTCATC TCAT T AACGAAAAGGATGGGGAAACCT TCCAGCTGATGGGGC TC T ATGGCCGA 478 

462 GMCCAGATTTGAGTTCAGACATCAAajAAAGGTTTGCACAACTATGTGAGGAGCATGGA 521 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII llllllllllllllllllll 

479 GAACCAGATTTGAGTTCAGACATCAAGGAAAGGTTTGCAAAACTATGTGAGGAGCATGGA 538 

522 ATCCTTAGAGAAAATATCATTGACCTATCCAATGCCAATCGCTGCCTCCACGCCCGAGAA 581 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

539 ATCCT TAGAGAAAATATCAT TGACCTATCCAATGCCAATCCC TGCC TCCAGGCCCGAGAA 598 

582 TGAAG AT TGGCC TGAGCCTCCAG TG T TG AG TGG AG AC T TCTCACCAGGACTCCACC ATCA 641 

linn iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiin 

599 TGAAGAATGGCCTGAGCCTCCAGTGTTGAGTGGAGACTTCTCACCAGGACTCCACCATCA 658 
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642 TCCCTTCCTATCCATACAGCATCCCCAGTATAAATTCTGTGATCTGCATTCCATCCTGTC 701 

659 TCCCTTCCTATCCATACAGCA^ 7,8 
702 TCACTGAGAAGTCCAATTCCAGTCTATCCACATGTTACCTAGGATACCTCATCAAGAATC 761 

719 IcACTGAGAAGTCCMTTCCAGW 778 
762 AAAGACTTCTTTAAATTTCTCTTTGATATACCCATGACAATTTTTCATGAATTTCTTCCT 821 

779 AAAGaM IMIIaAATT TcIMtTGAT AT ACC^TGACAATT T ItCATGAAT llcl IrcT 838 

822 CT TCCTG T TCAAT AAATGAT T ACCCT TGCACT T A 855 

839 CTTCCTGTTCAATAAATGATTACCCTTGCACTTA 872 

FIG.8B 



282 TGACCCGTACTTATTACAGCCGTACTGCTCCTATTATCACTACCAGTGCTAGCCGCAGGC 223 

5883 TCATOXTACTtW 5942 

222 ATTACTATACTACTAACAGACCGCAACCTAAACACAACTTTCTTTGATCCCGCTGGAGGA 163 

5943 aIIaCTATACUCT^ 6002 

162 GGGGACCCAATTCTCTACCAGCATCTGTTCTGATTCTTTGGGCACCCAGAAGTTTATATT 103 

6003 GGGGACCCAATTCTCTACCAGCATCTGTTCTGATTCTTTGGGCACCCAGAAGTTTATATT 6062 

102 CTTATCCTCCCAGGATTTGGAATTATTTCACATGTAGTTACTTACTACTCCGGAAAAAAA 43 

6063 cnATCCTCCCAGGATTTGGM^ 6122 

42 GAACCTTTCGGCTATATAGGAATAGTATGAAAAAAAA 6 

6123 GAACCT T IMgCTAT AT AGGaItAGTATGAGCAAT AA 6159 
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1 TTGTTGCGATCCCACCAACCTACACTATGAGTTTCTTGTCCCGTTGATCCTGGGCTGCAT 60 

Nil llllllllllllllllllllllllllllllllllllllllllllllllllllll 
237 TTGTCACGATCCCACCAACCTACACTATGAGTTTCTTGTCCCGTTGATCCTGGGCTGCAT 296 

61 GAGGTTAAAGOJMTGATTGAGACCAGA^TCACG^^ 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

297 GAGGTTAAAGGGAATGATTGAGACCAGACAAGTCAGGGGTTGAAACTTAGAAAAGGTCAA 356 
121 AGG T ACAG AAGAAACAGAGGACAC T TCG T AGACT TGCAGAGGAT AT T TCAAAGGT AGCCA 180 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

357 AGG TACAGAAGAAACAG AGGACACT TCGTAG ACT TGCAGAGGAT ATT TCAAAGGT AGCCA 416 
181 GAGAAGGGGGAMTTATACTATGTTGTCMTAGGAATAATAAAATAATAAAAGTAGATAT 240 

IIIMIIIHIIIIIIIIIIMIIIIIIIMM MlltlMIIII MMIIIIIIM 

417 GAGAAGGGGGAAATTATACTATGTTGTCAATAGGAATAATAAAATAATAAAAGTAGATAT 476 
241 TATTTATGGAA 251 

Ilillllll I 

477 TATTTATGGCA 487 



FIG.10 



237 GG TKG T TGAG TG TGGC TGAC TGGC ATGCGCAGAGACCCAATGG T TCAGGCGC TGCC TG TC 178 

II I llllllllllllllllllllllllllllllllllllllllllllllilllllll 
628 GGGTGGTCAGTGTGGCTGACTGGGATGCGCAGAGACCCAATGCTTCAGGCGCTGCCTGTC 687 

177 TCTCTGCCACTCCATCTTTCCTGTTGCCAGAGAGCCACCTGGCTGCCCCACCACCCACCA 1 18 

llllllllllllllllllillllllllllllllllllillllllllllllllllllllll 
688 TG TC TGCCAC TCCATCT T TCCTG T TGCCAGAGAGCCACC TGGC TGCCCCACCAGCC ACC A 747 

117 TACCAAGGAGCATCTGGAGCCTCTTCTTATTTGGCCAGCACTCCCCATCCACCTGTCTTA 58 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
748 T ACCAAGGAGCATCTGGAGCCTCTTC T T AT T TGGCCAGCACTCCCCATCCACCTGTC T T A 807 

57 AC ACCACCAA TGGCG TCCCC T T TC TGCTG AA TAAAT ACATGCCCCC 12 

illlllllllllllllllllllllllMIIIIIIIIIIIIIIIIII 
808 ACACCACCAATGGCG TCCCCT T TCTGCTGAATAAAT ACATGCCCCC 853 



263 CCGACCAATGCATTGACAACTGAATGGGTKGT 232 

Illllllllllllllllllllllllllll II 
603 CCGACCAATGCATTGACAACTGAATGGGTGGT 634 
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1 TCCGCTCTGGATGCCAGGGTGATTCTGGGGGACCCCTCCACTGCATGGTGAACGGTCAGT 60 

527 TCCGCTCTGGATGCCAGGGTGATTCTGGGGGACCCCTCCACTGCATGGTGAACGGTCAGT 586 

61 ATGCTGTCCACGG ^ TGACC ^ 120 

587 ATGCTG TCGACGGAGTGACCAGCT T TGTG TCCAGCATGGGCTG T AATGTCGCCAGG AAGC 646 

121 CCACCGTCTTCACCAGAGTCTCTGCTTACATTTTC 155 

647 CCACCGTCT TCACCAGACTCTCTGCTT ACAT T TCC 681 

315 AGCCTTGGGCTCCATCCCTAATACTGCAACAGGAGCAGGGGAATGCTGCTGGTGTCTTGG 374 

839 AGCCTTG^CTCCATCCCT^ 898 

375 TATCTGGGGCAAAGGTGGGGGG 396 

899 TATCTGGGGCAAAGGTGGGGGG 920 

391 GGGGGGGGTTCATGAAAAGCAACTCAGACTACTGAATCAGATACAGAAAGGCAAATAAAA 450 

913 GTCGGGTCTTAATGAAAAGCMCTCAGACTACTGAATCAGATACAGAAAGGCAM 972 

451 ATCAATGTGTTA 462 
973 ATCAATGTGTTA 984 
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39 GGGGCAGGGGATCTGCTCAGCTCTATGTTTGAGTTCAGTGAGAAGCTGAATGCCCTCCAG 98 

II illllllllllllllllllllllllllllllllllllllillllllfllllllllll 
1321 GGAGCAGGGGATCTGCTCAGCTCTATGTTTGAGTTCAGTGAGAAGCTGAATGCCCTCCAG 1380 

99 CTCAGTGATGAGGAAATGAGCTTGTTCACAGCAGTTGTTCTGGTATCTGCAG 150 

lllllllllllllllllllllllllllllllllllllllllllllillllll 
1381 CTCAG TGATGAGGAAATGAGC TTGTTCACAGCAGT TGT TCTGG TATC TGCAG 1432 



151 GATCGATCTGGAAT TGAAAATGTCAACTCAGTGGAGGCT T TGCAGGAAACACTCATCCG T 210 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
1432 GATCGATCTGGAATTGAAAATGTCAACTCAGTGGAGGCTTTGCAGGAAACACTCATCCGT 1491 

211 GCACTAAGGACCTTAATAATGRAAAAACCATCCAAATGAGGCCTCCATTTTTACAAAAT 269 

lllllllllllllllllllll INI I I II III Nil III I 
1492 GCACTAAGGACCTTAAT AATGAAAAACCATCCAAATGAGGCCTCCAT T T T T ACAAAAT 7 1550 



233 AAAAACCATCCAAATGAGGCCTCCATTTTTACAAAATTACTTCTAAAGTTGCCAGRTCTT 292 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII III! 
1513 AAAAACCATCCAAATGAGGCC TCCAT TT T TACAAAAT T AC T TCT AAAG T TGCCAG ATCT T 1572 

293 CGATCTTTAAACAACATGCACTC 315 

lllllllllllimilllllll 
1573 CGATCTTTAAACAACATGCACTC 1595 



4 GGTAAGAAGTACAGTGTGGATGACCTGCACTCAATGGG 41 

iiiumiiiiiiiiiiiiiiiiiimiiimiii 

1285 GGT AAGAAGT ACAG TG TGGATG ACC TGCAC TC AATGGG 1322 
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CGC TGT CTC GGG 



AGG TCT GAA GAC 



61/21 
GTG GCA 
vol ola 
121/41 
AGG AAG 
org lys 
181/61 
TGT GTT 
cys vol 
241/81 
AAA CTC 
lys leu 
301/101 
ACC ATC 
thr ile 
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met 
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lys 
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asn glu 
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GAT GGT 
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osn lys 
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pro lys 

ATA TCT 
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lys his 

CAA GAC 
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TGA 
CPA 



ATG AAC CAC TTG TTC 



31/11 

CTG CTT CAG 
leu leu gin 
91/31 

AGA TGT TTT 
org cys phe 
151/51 

GAG ATG GCA 

glu met gly 

211/71 

AAG GAG CAC 

lys glu his 

271/91 

TAT ATG ATC 

lyr met i le 

331/111 

ACG AAG GCC 



421/141 

AAT AAA GTT CGA 



ACT GGG CTG CTT CTG TCC 
CCC TTT TGG CTG 



391/131 
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START 



1/1 31/11 
otq gtl ggt Uc oog gcc oca got gig ccc ceo oco gec oct gig oog tic clg ggg get 
Met vol gly phe lys olo thr osp vol pro pro thr olo thr vol lys phe leu gly olo 
6T721 ' 91/31 

ggg oco get gcc tgc ott gco got etc otc oct ttc cct ctg got occ gcc oog gle egg 
gly thr olo olo cys i I e ola osp leu ile thr phe pro leu osp thr olo lys vol org 
121/41 151/51 

ctg cog otc coo ggg gog ogl coo ggg cto gtg cgc occ gca gcc age gcc cog toe cgt 
leu gin ile gin gly glu ser gin gly leu vol org thr ola ala ser olo gin tyr org 
181/61 ' J ' 211/71 

ggc gtt ctg ggt ace otc cto occ otg gtg cgc act gog ggt ceo cgc ogc etc tac ool 
gly vol leu gly thr ile leu thr met vol org thr glu gly pro org ser leu tyr osn 
241/81 271/91 

ggg ctg gtc gcc ggc ctg cog cgc cog otg ogc ctt gcc tec gtc cgc ott ggc etc toe 
gly leu vol olo gly leu gin org gin met ser leu olo ser vol org ile gly leu tyr 
301/101 ' ' ' 331/111 

goc let gtc ooo cog ttc toe occ oog ggc tea gog col ggo ggc otc ggg ogc cgc etc 
osp ser vol lys gin phe tyr thr lys gly ser glu his gly gly ile gly ser org leu 
361/121 391/131 

clg gco ggt ogc occ oco ggt gcc ctg gcc gtg gtt glo gcc cog eel oco got gtg gto 
leu olo gly ser Ihr thr gly olo leu olo vol vol vol olo gin pro thr osp vol vol 
421/141 451/151 

oog gtc cgc Itc cog gel ceo ggc egg gel ggt ggt ggt egg ogo toe oga gco clg teg 
lys vol org phe gin olo pro gly org olo gly gly gly org org lyr org olo leu ser 
481/161 J 511/171 

ogc toe oog ooc otc ocg ogo ggo ggg otc egg ggc etc tgg oog ggo etc tec coo tgt 
ser tyr lys osn ile thr org gly gly ile org gly leu trp lys gly leu ser gin cys 
541/181 ' 571/191 

gcc cgt ool gcc oil gtc ooc tgt get gog ctg gtg occ tot gac etc ate ooo got ocl 
olo org osn olo ile vol osn cys olo glu leu vol thr lyr osp leu ile lys osp thr 
601/201 631/211 

etc ctg age cac etc otg oca got goc etc cct tgc coc Itc oct tct gcc tic ggg gcg 
leu leu ser his leu mel thr osp osp leu pro cys his phe thr ser olo phe gly olo 
661/221 691/231 

ggc ttc tgc occ occ gle otc gcc lee cct gtg got gtg gle oog ocg ogo toe otg oct 
gly phe cys Ihr Ihr vol ile olo ser pro vol osp vol vol lys thr org tyr met thr 
721/241 751/251 

ctg ctg ggc cog toe cac age gco ggt coc tgt gcc ctt aco tgc teg gag gag ggo ccc 
leu leu gly gin tyr his ser olo gly his cys ola leu thr cys ser glu glu gly pro 
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781/261 
gcg etc ttc 
olo leu phe 
841/281 
atg tit gtc 
met phe vol 
901/301 
gco cct ttc 
ola pro phe 
961/321 
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oac cog ggg gtt otg cct 
osn gin gly vol met pro 

occ tot gog cag etc coa 
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1141 

ccc ogc ceo 
1201 

000 00 
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tea gee ctt tct aca gel 
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811/271 
tec ttt etc cgc 
ser phe leu org 

871/291 
ogo gec do otg 
org olo leu met 

931/311 
cct ggo ccc tgc 

991 

etc ttc cca tit 
1051 

got goc tco cog 
1111 

oog ccc oca let 
1171 

cot ooo gco age 



tig ggo tec tgg ooc gto gtg 
leu gly ser trp asn vol vol 

get gec loc coo tct egg gag 
olo olo tyr gin ser org glu 

ttc ceo gee ctg ccc tgt ctt 



cct gco ccc ego ttl ocl tec 



tgo ggo ggc clg oeo ceo goc 



tco tct tco tec ceo gec cog 



tco otg tto ooo ooo ooo aoo 
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PTATVKFLGAGTAACIA 27 
CCT ACT GCC ACT GTG AAG TTT CTT GGG GCT GGC ACA GCT GCC TGC ATC GCA 60 

DIITFPLDTAKVRLQIQGES 47 
GAT CTC ATC ACC TTT CCT CTG GAT ACT GCT AAA GTC CGG TTA CAG ATC CAA GGA GAA ACT 120 

OGPVRATVSAQYRGVMGTIL 67 
CAG GGG CCA GTG CGC GCT ACA GTC AGC GCC CAG TAC CGC GGT GTG ATG GGC ACC ATT CTG 180 

TMVRTEGPRSLYNGLVAGLQ 87 
ACC ATG GTG CGT ACT GAG GGC CCC CCA AGC CTC TAC AAT GGG CTG GTT GCC GGC CTG CAG 240 

R0MSFASVRIGLYDSVKQFY107 
CGC CAA ATG AGC TTT GCC TCT GTC CGC ATC GGC CTG TAT GAT TCT GTC AAA CAG TTC TAC 300 

TKGSEHAS1GSRLLAGSTTG127 
ACC AAG GGC TCT GAG CAT GCC AGC ATT GGG AGC CGC CTC CTA GCA GGC AGC ACC ACA GGT 360 

ALAVAVAQPTDVVKVRFQAQH7 
GCC CTG GCT GTG GCT GTG GCC CAG CCC ACG GAT GTG GTA AAG GTC CGA TTC CAA GCT CAG 420 

AR AG GG RRYQST VN AYK T 1 A 167 
GCC CGG GCT GGA GGT GGT CGG AGA TAC CAA AGC ACC GTC AAT GCC TAC AAG ACC ATT GCC 480 

RE FGFRGLWKGTSPNVARNA187 
CGA GAG GAA GGG TTC CGG GGC CTC TGG AAA GGG ACC TCT CCC AAT GTT GCT CGT AAT GCC 540 

IV NCAELVTY0LIKDALLKA207 
ATT GTC AAC TGT GCT GAG CTG GTG ACC TAT GAC CTC ATC AAG GAT GCC CTC CTG AAA GCC 600 

NIMTOOLPCHFTSAFGAGFC227 
AAC CTC ATG ACA GAT GAC CTC CCT TGC CAC TTC ACT TCT GCC TTT GGG GCA GGC TTC TGC 660 

TTV1A SPVDVVKTRNMNSAL247 
ACC ACT GTC ATC GCC TCC CCT GTA GAC GTG GTC AAG ACG AGA AAC ATG AAC TCT GCC CTG 720 

roYSSAGHCALTMLQKEGPR267 
GGC CAG TAC ACT AGC GCT GGC CAC TGT GCC CTT ACC ATG CTC CAG AAG GAG GGG CCC CGA 780 
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AFYKGFMPSFL 
GCC TTC TAC AAA GGG TTC ATG CCC TCC TTT CTC 

FGTYEQIKRAL 
TTC GGC ACC TAT GAG CAG CTG AAA CCA GCC CTC 

P F « 

CCC TTC TGA GCC TCT CCT GCT GCT GAC CTG ATC 
ATG CTT TCC TTT TCT TCC TTC TTT CTC TTC CCT 
CCA CCT CTT CCT TCC GCT CCT TTA CCT ACC ACC 
CAT TGT CTC AGT GCT GGT GGA CTT GAC ATT TGA 
GAT CCC AAG CGT CCC GTC CCT TGG AAA GTT CAG 
CCC AGC CTA GCC CAC TTG TCA TCC ATA AAG CAA 
AAAA 1264 



RLGSWDVVM 287 

CGC TTG GGT TCC TGG GAC GTG GTG ATG 840 

MAACTSREA 307 

ATG CCT GCC TCC ACT TCC CCA GAG GCT 900 

309 

ACC TCT GCC TTT CTC TCT AGC CGG GCC 960 

CCT TCC CTT CTC TCC TTC CCT CTT TCC 1020 

TTC CCT CTT TCT ACA TTC TCA TCT ACT 1080 

CAG TGT GGG AGC CCT CCT ACC ACC CAC 1140 

CCA GAA TCT TCC TCC ICC CCC CCA CAG 1200 

CCT CAA CCI TCA AAA AAA AAA AAA AAA 1260 
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MVGFKATDVP PTATVKFLGA GTACIADLIT 


FPLDTAKVRL QIQGESQGPV 


50 


RATVSAOYRG VMGTILTMVR TEGPRSLYNG 


LVAGLQRQMS FASVRIGLYD 


100 


SVKQFYTKGS EHASIGSRLL AGSTTGALAV 


AVAQPTDWK VRFQAQARAG 


150 


GGRRYQSTVN AYKTIAREEG FRGLWKGTSP 


NVARNA 1 VNC AELVTYOLIK 


200 


DALLKANLMT DDLPCHFTSA FGAGFCTTV1 


ASPVDWKTR NMNSALGQYS 


250 


SAGHCALTML QKEGPRAFYK GFMPSFLRLG 


SWDWMFGTY EQLKRALMAA 


300 


CTSREAPF 




308 
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108 GATCCCTGGCCAGAAGTACAAAGACACTAACCTGCTAATACTCTTTAAGGAAGATTACTT 167 

IIIIIIIIIMMIIIIillllilllllllllMIIIMIIIIIIIIIMIIIIIIIIII 
647 GATCCCTGGCCAGAAGTACAAAGACACTAACCTGCTAATACTCTTTAAGGAAGATTACTT 706 

168 TGC AAAAAAAAATG AAG AAAG AAAGC AG AGC AAAG TGG AAGC T AAA T T AAAAGC T AAAC A 227 

IIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
707 TGC AAAAAAAAATG AAG AAAG AAAGCAGAGCAAAG TGG AAGC T AAA T T AAAAGC T AAAC A 766 

228 AG AGCATG AAGG AAGAC AC AAGCCAGG AAG T AC TG AAACC 267 

IIIIIMIIIIIIIIIIIIMIIIIIIIIIIIMIMIII 
767 AG AGCA TG AAGG AAGAC AC AAGCCAGG AAG T AC TG AAACC 806 
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189 AGGGTCAGTGGTCTCCAGTCTTCTCCCAGTTTGGGCTTATTGATGTCCAATAAACAAGTT 130 

i m i in it i i in linn ii iiiiiiiiiiiiiiniii ii 

4485 AGGGGCAGGGGCTCCATTCTTCTCCCCGGTTTGGTCTGATTGATGTCCAATAAACAACTT 4544 

129 CTGTGTCTGCAAA 117 

Nil III III! 
4545 CTGTATCTTCAAA 4557 



219 CCCTACCAACTCTGTGCAGTGGTTCGCTAAAGGGTCAGTGGTCTCCAGTCTTCTCCCAGT 160 

II llllllllllllllillll llllll II II lllll lllllllll I 
4454 CCTTACCAACTCTGTGCAGTGGCTCGCTAGCAGGGGCAGGGGCTCCATTCTTCTCCCCGG 4513 

159 TTGGG 155 

II II 
4514 TTTGG 4518 
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1 GGAATTCGGC ACGAGGAACT AGTCTCGAGT AGCTCTGTGT 

41 GTGTAGACAC TCCCTAAGCC ACCCGAGGGC TGCCTCTGTG 

81 TGGTGGGGAG GTAGGAGAGG GGAAGAGGGT TGCTATTCTC 

121 ATTTTATACT TTTCCCACGG CTTCCAACTT TATGCCAATA 

161 ACTCACAATT AAACAGGTCT CAGAACTAAG GGAGCTTTCA 

201 ACACTTGGGC TAACTCAAGT GCAGTGAGCG AACGGGTTCT 

241 CCAAACCTGC TGGGTCCTGG CTACCTATGT CACACAGATG 

281 AGTGAGTCCA CCTGCCTTCC ATTCTCTGAG CTTGTGGTCA 

321 CAGCAGGAGT CTTCCAGGGC ACTGCCTTGG STTGGAAATG 

361 TAAGTCCGTC CGGAAATAGT ATCCCCATCA GCTCTCTGGT 

401 GTTTGTGAGC TCTAGCATGC CATCCAGGTT CCAAGGAGGA 

441 GAGTCACGAA GGTCCCAGAG CCGCAAGGTG CACAGAGCAC 

481 CC TAT AAACC CTGCAGTCCT CTTTGAGTCA TGGGGAGGAC 

521 TGTGGATCCC CTTCTTTGAG TCTAAAGAGG CTCAGGAATC 

561 CATATTCTCC ACAAGGTCCA GGTGGGAAGT GCATGCATGA 

601 AATCAGACTC CAAAAAATGA ACCACAAGTC ACAGGAGAGA 

641 GAGAGAACTA GTCTCGAGCT CGTGCCGAAT TCGGCACGAG 

681 GTGTAGCTCA GGTAGCCCAG GGAGCAATCA AGCTCTGCTT 

721 CCTGCTGCGC AGCTGTGGCT CGCTCCTGCC CGAACTGAGT 

761 CTCGCCGAGA GGACAGAGTT TGCTCACAAG ATCTGGGACA 

801 AACTTCAGCA GTTAGGTGTC GTATATGATG TCAGTCATTA 

841 CAATGCTTTA CTTAAAGTAT ATCTTCAAAA TGAATACAAA 

881 TTTTCACCTA CTGACTTCCT GGCAAAGATG GAGGGAGCAA 

921 ACATCCAACC AAATCGAGTA ACATACCAGA GGCTGATAGC 

961 TGCCTACTGT AATGTTGGGG ACATTGAAGG TGCCAGCAAG 

1001 ATCCTTGGAT TTATGAAAAC GAAAGACCTT CCGATCACAG 

1041 AGGGCGTGTT CAGTGCTCTC GTCACAGGGC ATGCGAGAGC 

1081 TGGGGATATG GAAAATGCAG AAAATATTCT CACAGTGATG 

1121 AAACAGGCCG GCATTGAGCC TGGCCCAGAC ACGTATCTGG 

1161 CCTTGTTGAA TGCACATGCT GAGAGGGGTG ACATTGGCCA 

1201 GGTTAGGCAG ATTCTGGAGA AAGTGGAGAA GTCAGACCAT 

1241 TACTTCATGG ACCGCGACTT CTTGCAGGTT ATTTTTAGCT 

1281 TCAGTAAGGC TGGCTACCCT CACTCGAG 
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